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GENETIC SEQUENCES AND PROTEINS 
REZATED TO ALZHEIMER'S DISEASE, 
AND USES THEREFOR 

Cross Reference To Related Aonlieat jnr.B 

This application is a Continuation- In-Part of U.S. 
application Serial No. 08/509,359, filed on July 31, 1995, which 
is a Continuation-In-Part of U.S. application Serial No. 
08/496,841, filed on June 28, 1995, which is a Continuation-in- 
Part of U.S. Application Serial No. 08/431,048, filed on April 
28, 1995, all of which were entitled GENETIC SEQUENCES AND 
PROTEINS RELATED TO ALZHEIMER'S DISEASE (Inventors: Peter H. St. 
George-Hyslop, Johanna M. Ronnnens and Paul E. Eraser), and all of 
which are incorporated herein by reference. 

Field of the Invention 

The present invention relates generally to the field of 
neurological and physiological dysfxinctions associated with 
Alzheimer's Disease. More particularly, the invention is 
concerned with the identification, isolation and cloning of genes 
which are associated with Alzheimer's Disease, as well as their 
transcripts, gene products, associated sequence information, and 
related genes. The present invention also relates to methods for 
detecting and diagnosing carriers of normal and mutant alleles of 
these genes, to methods for detecting and diagnosing Alzheimer's 
Disease, to methods of identifying genes and proteins related to 
or interacting with the Alzheimer's genes and proteins, to 
methods of screening for potential therapeutics for Alzheimer's 
Disease, to methods of treatment for Alzheimer's Disease, and to 
cell lines and animal models useful in screening for and 
evaluating potentially useful therapies for Alzheimer's Disease. 

Backcrround of the Invent-.inn 

In order to facilitate reference to various journal 
articles, at listing of the articles is prxwided at the end of 
this specification. 

Alzheimer's Disease (AD) is a degenerative disorder of the 
hxOTian central nervous system characterized by progressive memory 
impairment and cognitive and intellectual decline during mid to 
late adult life (Katzman, 1986), The disease is accompanied by 
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a constellation of neuropathologic features principal among^ 
which are the presence of extracellular amyloid or senile plaques 
and the neurofibrillary degeneration of neurons. The etiology of 
this disease is complex, although in some families it appears to 
be inherited as an autosomal dominant trait. However, even 
amongst these inherited forms of AD. there are at least three 
different genes which confer inherited susceptibility to this 
disease (St. George-Hyslop et al., 199O) . The e4 (C112R) allelic 
polymorphism of the Apolipoprotein E (ApoE) gene has been 
associated with AD in a significant proportion of cases with 
onset late in life (Saunders et al., 1993; Strittmatter et al., 
1993). Similarly, a very small proportion of familial cases with 
onset before age 65 years have been associated with mutations in 
the ^-amyloid precursor protein (APP) gene (CSiartier-Harlin et 
15 al., 1991; Goate et al., 1991; Murrell et al . . 1991; Karlinslcy et 
al.. 1992; Mullan et al.. 1992). A third locus (AD3) associated 
with a larger proportion of cases with early onset AD has 
recently been mapped to chromosome 14q24.3 (Schellenberg et al 
1992; St. George-Hyslop et al., 1992; Van Broeckhoven et al. 
20 1992). 

Although the chromosome I4q region carries several genes 
which could be regarded as candidate genes for the site of 
mutations associated with AD3 (e.g.. cPOS, alpha-i- 
antichymotrypsin, and cathepsin G) . most of these candidate genes 
have been excluded on the basis of their physical location 
outside the AD3 region and/or the absence of mutations in their 
respective open reading frames (Schellenberg et al.. 1992- Van 
Broeckhoven et al.. 1992; Rogaev et al.. 1993; Wong et al'.. 



25 



30 



1993) 



There have been several developments and commercial 
directions or strategies in respect of treatment of Alzheimer's 
Disease and diagnosis thereof. Published PCT application WO 94 
23049 describes transfection of high molecular weight YAC DNA 
into specific mouse cells. This method may be used to analyze 
35 large gene complexes. For example, the transgenic mice may have 
increased APP gene dosage, which mimics the trisomic condition 
that prevails in Down's Syndrome, and allows the generation of 
animal models with ^-amyloidosis similar to .that prtevalent in 
individuals with Alzheimer's Disease. Published international 
40 application wo 94 00569 describes transgenic non-human animals 
harbouring large transgenes such as the transgene comprising a 
human APP gene. Such animal models can provide useful models of 
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human genetic diseases such as Alzheimer's Disease. 

Canadian Patent application No. 2096911 describes a nucleic 
acid coding for an APP-cleaving protease, which is associated 
with Alzheimer's Disease and Down's syndrome. The genetic 
5 information, which was isolated from chromosome 19, may be used 
to diagnose Alzheimer's Disease. Canadian Patent application 
2071105, describes detection and treatment of inherited or 
acquired Alzheimer's Disease by the use of YAC nucleotide 
sequences. The YACs are identified by the nimibers 23CB10, 28CA12 

10 and 26FF3. 

U.S. Patent 5,297,562, describes detection of Alzheimer's 
Disease associated with trisomy of chromosome 21. Treatment 
involves methods for reducing the proliferation of chromosome 21 
trisomy. Canadian Patent application No, 2054302 describes 

15 monoclonal antibodies which recognize a human brain cell nucleus 
protein encoded by chromosome 21 and are used to detect changes 
of expression due to Alzheimer's Disease or Down's Syndrome. 
The monoclonal antibody is specific to a protein encoded by human 
chromosome 21 and is found in large pyramidal cells of human 

20 brain tissue. 

Summary of the Invention 

The present invention is based, in part, upon the 
identification, isolation, cloning and sequencing of two 
mammalian genes which have been designated presenilin-1 (PSl) and 

25 presenilin-2 (PS2) . These two genes, and their corresponding 
protein products, are members of a highly conserved family of 
genes, the presenilins, with homologues or orthologues in other 
mammalian species (e.g., mice, rats) as well as orthologues in 
invertebrate species (e.g., C. ?leq^^?, D. melanoaaster^ . 

3 0 Mutations in these genes have been linked to the development in 
humans of forms of Familial Alzheimer's Disease and may be 
causative of other disorders as well (e.g., other cognitive, 
intellectual, neurological or psychological disorders such as 
cerebral hemorrhage, schizophrenia, depression, mental 

35 retardation and epilepsy) . The present disclosure provides 

genomic and cDNA nucleotide sequences for human psi (hPSl) and 
humcui PS2 (hPS2) genes, a murine PSl homologue (mPSl) , and 
related genes from C. eleaans (sel-12, SPE-4) and D. melanoaaster 
(DmPS) . The disclosure also provides the predicted amino acid 

40 sequences of the presenil in proteins encoded by these genes and a 
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Structural characterization of the presenilins. including 
putative functional domains and antigenic determinants. A number 
of mutations in the presenilins which are causative of 
Alzheimer's Disease (AD) in humans are also disclosed and related 
5 to the fiinctional domains of the proteins. 

Thus, in one series of embodiments, the present invention 
provides isolated nucleic acids including nucleotide sequences 
comprising or derived from the presenilin genes and/or encoding 
polypeptides comprising or derived from the presenilin proteins. 
10 The presenilin sequences of the invention include the 

specifically disclosed sequences, splice variants of these 
sequences, allelic variants of these sequences, synonymous 
sequences, and homologous or orthologous variants of these 
sequences. Thus, for example, the invention provides genomic and 
15 CDNA sequences from the hPSl gene, the hPS2 gene, the mPSl gene, 
and the DmPS gene. The present invention also provides allelic' 
variants and homologous or orthologous sequences by providing 
methods by which such variants may be routinely obtained. The 
present invention also specifically provides for mutant or 
20 disease -causing variants of the presenilins by disclosing a 

number of specific mutant sequences and by providing methods by 
which other such variants may be routinely obtained. Because the 
nucleic acids of the invention may be used in a variety of 
diagnostic, therapeutic and recombinant applications, various 
25 subsets of the presenilin sequences and combinations of the 
presenilin sequences with heterologous sequences are also 
provided. For example, for use in allele specific hybridization 
screening or PGR amplification techniques, subsets of the 
presenilin sequences, including both sense and antisense 
30 sequences, and both normal and mutant sequences, as well as 

intronic, exonic and untranslated sequences, are provided. Such 
sequences may comprise a small number of consecutive nucleotides 
from the sequences which are disclosed or otherwise enabled 
herein but preferably include at least 8-10, and more preferably 
35 9-25, consecutive nucleotides from a presenilin sequence, other 
preferred subsets of the presenilin sequences include those, 
encoding one or more of the functional domains or antigenic 
determinants of the presenilin proteins and, in particular, may 
include either normal (wild-type) or mutant sequences. The 
40 invention also provides for various nucleic acid constructs in 
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Which presenilin sequences, either complete or subsets, are 
operably joined to exogenous sequences to form cloning vectors 
expression vectors, fusion vectors, transgenic constructs, and' 
the like. Thus, in accordance with another aspect of the 
5 invention, a recombinant vector for transforming a mammalian or 
invertebrate tissue cell to express a normal or mutant presenilin 
sequence in the cells is provided. 

In another series of embodiments, the present invention 
provides for host cells which have been transfected or otherwise 
10 transformed with one of the nucleic acids of the invention The 
cells may be transformed merely for purposes of propagating the 
nucleic acid constructs of the invention, or may be transformed 
so as to express the presenilin sequences. The transformed cells 
of the invention may be used in assays to identify proteins 
15 and/or other compounds which affect normal or mutant presenilin 
expression, which interact with the normal or mutant presenilin 
proteins, and/or which modulate the function or effects of the 
normal or mutant proteins, or to produce the presenilin proteins, 
fusion proteins, functional domains, antigenic determinants, 
20 and/or antibodies of the invention. Transformed cells may also 
be implanted into hosts, including humans, for therapeutic or 
other reasons. Preferred host cells include mammalian cells from 
neuronal, fibroblast, bone marrow, spleen, organotypic or mixed 
cell cultures, as well as bacterial, yeast, nematode, insect and 
25 other invertebrate cells. For uses as described below, preferred 
cells also include embryonic stem cells, zygotes, gametes, and 
germ line cells. 

In another series of embodiments, the present invention 
provides transgenic animal models for AD and other diseases or 
iO disorders associated with mutations in the presenilin genes. The 
animal may be essentially any mammal, including rats, mice 
hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pige. 
and non-human primates, m addition, invertebrate models, 
including nematodes and insects, may be used for certain 
applications. The animal models are produced by standard 
twsgenic methods. including microinjection, transfection. or 
other forms of transformation of embryonic stem cells, zygotes 
gametes, and germ line cells with vectors including genomic or' 
CDNA fragments, minigenes. homologous recombination vectors, 
vxral insertion vectors and the like. Suitable vectors include 



35 
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vaccinia virus, adenovirus, adeno associated virus, retrovirus, 
liposome transport, neuraltropic viruses, cuid Herpes simplex 
virus. The animal models may include transgenic sequences 
comprising or derived from the presenilins, including normal and 
5 mutauit sequences, intronic, exonic and tintranslated sequences, 
auid sequences encoding subsets of the presenilins such as 
functional domains. The major types of animal models provided 
include: (1) Animals in which a normal hiiman presenilin gene 
has been recombinantly introduced into the genome of the animal 

10 as an additional gene, under the regulation of either an 

exogenous or an endogenous promoter element, and as either a 
minigene or a large genomic fragment; in which a normal hximan 
presenilin gene has been recombinantly substituted for one or 
both copies of the animal's homologous presenilin gene by 

15 homologous recombination or gene targeting; auid/or in which one 
or both copies of one of the animal's homologous presenilin genes 
have been recombinantly ''humanized" by the partial substitution 
of sequences encoding the human homologue by homologous 
recombination or gene targeting . (2) Animals in which a mutant 

20 human presenilin gene has been recombinantly introduced into the 
genome of the animal as an additional gene, under the regulation 
of either an exogenous or an endogenous promoter element, and as 
either a minigene or a large genomic fragment; in which a mutant 
human presenilin gene has been recombinauitly substituted for one 

25 or both copies of the animal's homologous presenilin gene by 

homologous recombination or gene targeting; and/or in which one 
or both copies of one of the animal's homologous presenilin genes 
have been recombinantly "hximauiized" by the partial substitution 
of sequences encoding a mutant human homologue by homologous 

30 recombination or gene targeting. (3) Animals in which a mutant 
version of one of that animal's presenilin genes has been 
recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or 
an endogenous promoter element, and as either a minigene or a 

35 large genomic fragment; and/or in which a mutant version of one 
of that animal's presenilin genes has been recombinantly 
substituted for one or both copies of the animal's homologous 
presenilin gene by homologous recombination or gene targeting. 
(4) "Knock-out" animals in which one or both copies of one of 

40 the animal's presenilin genes have been partially or completely 
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deleted by homologous recombination or gene targeting, or have 
been inactivated by the insertion or substitution by homologous 
recombination or gene targeting of exogenous sequences, in 
preferred embodiments, a transgenic mouse model for AD has a 
5 transgene encoding a normal human PSl or PS2 protein, a mutant 
human or murine PSl or PS2 protein, or a humanized normal or 
mutant murine PSl or PS2 protein. 

In another series of embodiments, the present invention 
provides for substantially pure protein preparations including 
10 polypeptides comprising or derived from the presenilins proteins. 
The presenilin protein sequences of the invention include the 
specifically disclosed sequences, variants of these sequences 
resulting from alternative mRNA splicing, allelic variants of 
these sequences, and homologous or orthologous variants of these 
15 sequences. Thus, for example, the invention provides amino acid 
sequences from the hPSl protein, the hPS2 protein, the mPSl 
protein, and the DmPS protein. The present invention also 
provides allelic variants and homologous or orthologous proteins 
by providing methods by which such variants may be routinely 
20 obtained. The present invention also specifically provides for 
mutant or disease -causing variants of the presenilins by 
disclosing a number of specific mutant sequences and by providing 
methods by which other such variants may be routinely obtained. 
Because the proteins of the invention may be used in a variety of 
25 diagnostic, therapeutic and recombinant applications, various 
subsets of the presenilin protein sequences and combinations of 
the presenilin protein sequences with heterologous sequences are 
also provided. For example, for use as iramunogens or in binding 
assays, subsets of the presenilin protein sequences, including 
30 both normal cuid mutant sequences, are provided. Such protein 
sequences may con^rise a small number of consecutive amino acid 
residues from the sequences which are disclosed or otherwise 
enabled herein but preferably include at least 4-8, and 
preferably at least 9-15 consecutive amino acid residues from a 
35 presenilin sequence. Other preferred subsets of the presenilin 
piTptein sequences include those, corresponding to pne or more of 
the fxinctional domains or antigenic determinants of the 
presenilin proteins and, in particular, may include either normal 
(wild- type) or mutant sequencies. The invention also provides for 
40 various protein constructs in which presenilin sequences, either 
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complete or sxibsets, are joined to exogenous sequences to form 
fusion proteins and the like, in accordance with these 
embodiments, the present invention also provides for methods of 
producing all of the above described proteins which comprise, or 
5 are derived from, the presenilins. 

In another series of embodiments, the present invention 
provides for the production and use of polyclonal and monoclonal 
antibodies, including cuitibody fragments, including Fab 
fragments, FCabMj, and single chain antibody fragments, which 

10 selectively bind to the presenilins, or to specific antigenic 

determinants of the presenilins. The antibodies may be raised in 
mouse, rabbit, goat or other suitable animals, or may be produced 
recombinantly in cultured cells such as hybridoma cell lines. 
Preferably, the antibodies are raised again presenilin sequences 

15 comprising at least 4-8, and preferably at least 9-15 consecutive 
amino acid residues from a presenilin sequence. The antibodies 
of the invention may be used in the various diagnostic, 
therapeutic and technical applications described herein. 

In another series of embodiments , the present invention 

20 provides methods of screening or identifying proteins, small 
molecules or other compounds which are capable of inducing or 
inhibiting the expression of the presenilin genes and proteins 
(e.g., PSl or PS2) . The assays may be performed in vitro using 
non- trans formed cells, immortalized cell lines, or recombinant 

25 cell lines, or j.n vAyo using the transgenic animal models enabled 
herein. In particular, the assays may detect the presence of 
increased or decreased expression of PSl, PS2 or other 
presenilin- related genes or proteins on the basis of increased or 
decreased mRNA expression, increased or decreased levels of 

3 0 presenilin-related protein products, or increased or decreased 
levels of expression of a marker gene (e.g., iS-galactosidase, 
green fluorescent protein, alkaline phosphatase or luciferase) 
operably joined to a presenilin 5' regulatory region in a 
recombinant construct. Cells known to express a particular 

35 presenilin, or transformed to express a particular presenilin, 
^re incubated and one or more test confounds arie added, to the 
medium. After allowing a sufficient period of time (e.g., 0-72 
hours) for the compound to induce or inhibit the expression of 
the presenilin, any change in levels of expression from an 

40 established baseline may be detected using any of the techniques 
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described above. In particularly preferred embodiments, the 
cells are from an immortalized cell line such as a human 
neuroblastoma, glioblastoma or a hybridoraa cell line, or are 
transformed cells of the invention. 

In another series of embodiments, the present invention 
provides methods for identifying proteins and other compounds 
which bind to, or otherwise directly interact with, the 
presenilins. The proteins and confounds will include endogenous 
cellular components which interact with the presenilins in vivo 
and which, therefore, provide new targets for pharmaceutical and 
therapeutic interventions, as well as recombinant, synthetic and 
otherwise exogenous compounds which may have presenilin binding 
capacity and, therefore, may be candidates for pharmaceutical 
agents. Thus, in one series of embodiments, cell lysates or 
tissue homogenates (e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which 
bind to one of the normal or mutant presenilins. Alternatively, 
any of a variety of exogenous compoxinds, both naturally occurring 
and/ or synthetic (e.g., libraries of small molecules or 
peptides) , may be screened for presenilin binding capacity. In 
each of these embodiments, an assay is conducted to detect 
binding between a "presenilin component" and some other moiety. 
The "presenilin component" in these assays may be any polypeptide 
comprising or derived from a normal or mutant presenilin protein, 
including functional domains or antigenic determinants of the 
presenilins, or presenilin fusion proteins. Binding may be 
detected by non-specific measures (e.g., changes in intracellular 
Ca'*, GTP/GDP ratio) or by specific measures (e.g., changes in A/8 
peptide production or changes in the expression of other 
downstream genes which can be monitored by differential display, 
2D gel electrophoresis, differential hybridization, or SAGE 
methods) . The preferred methods involve variations on the 
following techniques: (i) direct extraction by affinity 
chromatography; (2) co- isolation of presenilin components and 
bound proteins or other compounds by iromunoprecipitation; (3) 
the Biomolecular interaction Assay (BIAcore) ; and (4) the yeast, 
two-hj^rid systems . 

In another series of embodiments, the present invention 
provides for methods of identifying proteins, small molecules and 
other compounds capable of modulating the activity of normal or 
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mutant presenilins. Using normal cells or animals, the 
transformed cells and transgenic animal models of the present 
invention, or cells obtained from subjects bearing normal or 
mutant presenilin genes, the present invention provides methods 
5 of identifying such compounds on the basis of their ability to 
affect the expression of the presenilins, the intracellular 
localization of the presenilins, intracellular Ca^*, Na*, K* or 
other ion levels or metabolism, the occurrence or rate of 
apoptosis or cell death, the levels or pattern of A/S peptide 
10 production, the presence or levels of phosphorylation of 
microtubule associated proteins, or other biochemical, 
histological, or physiological markers which distinguish cells 
bearing normal and mutant presenilin sequences. Using the 
transgenic animals of the invention, methods of identifying such 
compounds are also provided on the basis of the ability of the 
compounds to affect behavioral, physiological or histological 
phenotypes associated with mutations in the presenilins. 

In another series of embodiments, the present invention 
provides methods for screening for carriers of presenilin alleles 
associated with AD, for diagnosis of victims of AD, and for the 
screening and diagnosis of related presenile and senile 
dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral 
hemorrhage, which associated with mutations in the PSl or PS2 
25 genes. Screening and/or diagnosis can be accomplished by methods 
based upon the nucleic acids (including genomic and mRNA/cDNA 
sequences), proteins, and/or antibodies disclosed and enabled 
herein, including functional assays designed to detect failure or 
augmentation of the normal presenilin activity and/or the 
30 presence of specific new activities conferred by the mutant 

presenilins. Thus, screens and diagnostics based upon presenilin 
proteins are provided which detect differences between mutant and 
normal presenilins in elect rophore tic mobility, in proteolytic 
cleavage patterns, in molar ratios of the various amino acid 
35 residues, in ability to bind specific antibodies. In addition, 
screens and diagnostics based upon nucleic acids (gDNA, cDNA or 
mRNA) are provided which detect differences in nucleotide 
sequences by direct nucleotide sequencing, hybridization using 
allele specific oligonucleotides, restriction enzyme digest and 
40 mapping (e.g., RPLP. REF-SSCP), electrophoretic mobility (e.g.. 
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SSCP, DGGE) , PCR mapping, RNase protection, chemical mismatch 
cleavage, ligase-mediated detection, and various other methods. 
Other methods are also provided which detect abnormal processing 
of PSi, PS2, APP, or proteins reacting with PSl, PS2, or APP 
5 (e.g., abnormal phosphorylation, glycosylation, glycation 
amidation or proteolytic cleavage) alterations in presenilin 
transcription, translation, and post-translational modification; 
alterations in the intracellular and extracellular trafficking of 
presenilin gene products; or abnormal intracellular localization 
10 of the presenilins. In accordance with these embodiments, 

diagnostic kits are also provided which will include the reagents 
necessary for the above-described diagnostic screens. 

In another series of embodiments, the present invention 
provides methods and pharmaceutical preparations for use in the 
15 treatment of presenilin-associated diseases such as AD. These 
methods and pharmaceuticals are be based upon (l) administration 
of normal PSl or PS2 proteins, (2) gene therapy with normal PSl 
or PS2 genes to compensate for or replace the mutant genes, (3) 
gene therapy based upon antisense sequences to mutant PSl or PS2 
genes or which "knock-out" the mutant genes, (4) gene therapy 
based upon sequences which encode a protein which blocks or 
corrects the deleterious effects of PSl or PS2 mutants, (5) 
immunotherapy based upon antibodies to normal and/or mutant PSl 
or PS2 proteins, or (6) small molecules (drugs) which alter PSl 
25 or PS2 expression, block abnormal interactions between mutant 
forms of PSl or PS2 and other proteins or ligands, or which 
otherwise block the aberrant function of mutant PSl or PS2 
proteins by altering the structure of the mutant proteins, by 
enhancing their metabolic clearance, or by inhibiting their 
30 fumction. 

In accordance with another aspect of the invention, the 
proteins of the invention can be used as starting points for 
rational drug design to provide ligands, therapeutic drugs or 
other types of small chemical molecules. Alternatively, small 
35 molecules or other compouhds identified by the above -described 
screening assays may serve as "lead compounds* in rational drug 
design. 

Particularly disclosed nucleotide and amino acid sequences 
of the present invention are numbered SEQ ID NOs: 1-25. In 
addition, under the terms of the Budapest Treaty, biological 
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deposits of particular nucleic acids disclosed herein have made 
with the ATCC (Rockville, MD) . These deposits include Accession 
Number 97124 (deposited April 28, 1995), Accession Number 97508 
(deposited on April 28, 1995), Accession Number 97214 (deposited 
5 on June 28, 1995), and Accession Number 97428 (deposited January 
26, 1996). 

pyief Description of the Drawinos 
Figure 1: This figure is a representation of the structural 
organization of the hPSl genomic DNA. Non-coding exons are 

10 depicted by solid shaded boxes. Coding exons are depicted by 

open boxes or hatched boxes for alternatively spliced sequences. 
Restriction sites are: B . BamHI; E - EcoRI; H = Hindlll; N « 
Not I; P = PstI; V = PvuII; X = Xbal . Discontinuities in the 
horizontal line between restriction sites represent undefined 

15 genomic sequences. Cloned genomic fragments containing each exon 
are depicted by double-ended horizontal arrows. The size of the 
genomic subclones and Accession number for each genomic sequence 
are provided. 

Figure 2: This figure is a representation of a hydropathy 

20 plot of the putative PSl protein. The plot was calculated 
according to the method of Kyte and Doolittle (1982) . 

Figure 3: This figure presents a sequence alignment of the 
hPSl and mPSl protein sequences. Vertical bars indicate 
identical amino acids. 

25 Figure 4: This figure presents a sequence alignment of the 

hPSl and hPS2 protein sequences. Vertical bars indicate 
identical amino acids. 

Figure 5: This figure is a schematic drawing of the 
predicted structure of the PSl protein. Roman numerals depict 

30 the transmembrsuie domains. Putative glycosylation sites are 

indicated as asterisks and most of the phosphorylation sites are 
located on the same membrane face as the two acidic hydrophilic 
loops. The MAP kinase site is present at residue 115 and the PKC 
site at residue 114. FAD mutation sites are indicated by 

35 horizontal arrows. 

Figure 6 ? This figure is a schematic drawing of the 
predicted structure of the PS2 protein. Roman numerals depict the 
transmembrane domains. Putative glycosylation sites are 
indicated as asterisks and most of the phosphorylation sites are 

40 located on the same membrane face as the two acidic hydrophilic 
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loops. FAD mutation sites are indicated by horizontal arrows. 

Petailed PeBcrlpl^inn »f t -.he ln^rf,r^f ^ r.r, 
I- Definitions 

in order to facilitate review of the various embodiments of 
the invention, and an understanding of the various elements and 
constituents used in making and using the invention, the 
following definitions are provided for particular terms used in 
the description and appended claims: 

Pr^senilip. As used without further modification herein, the 
terms 6presenilin6 or dpresenilinsS mean the presenilin-i (PSl) 
and/or the presenilin-2 <PS2) genes/proteins, m particular, the 
unmodified terms 6presenilin6 or 6presenilins6 refer to the 
mammalian PSl and/or PS2 genes /proteins and, preferably, the 
human PSl and/or PS2 genes/proteins. 

PrgggnA I in-l q^'np , as used herein, the term "presenilin-i gene" 
or "PSl gene" means the mammalian gene first disclosed and 
described in U.S. Application Ser. No. 08/431,048, filed on April 
28, 1995, and later described in Sherrington et al. (1995) 
including any allelic variants and heterospecif ic mammalia^ 
homologues. One human presenilin-i (hPSl) cDNA sequence is 
disclosed herein as SEQ id NO: 1. Another human cDNA sequence, 
resulting from alternative splicing of the hPSl mRNA transcript 
IS disclosed as SEQ id NO: 3. Additional human splice variants' 
as described below, have also been found in which a region 
encoding thirty-three residues may be spliced-out in some 
transcripts. A cDNA of the murine homologue (mPSl) is disclosed 
as SEQ ID NO: 16. The term -presenilin-1 gene" or -PSi gene" 
primarily relates to a coding sequence, but can also include some 
or all of the flanking regulatory regions and/or introns. The 
term PSl gene specifically includes artificial or recombinant 
genes created from cDNA or genomic DNA, including recombinant 
genes based upon splice variants. The presenilin-l gene has also 
been referred to as the S182 gene (e.g., Sherrington et al 
1995) or as the Alzheimer's Related Membrane Protein (ARMP) gene 
(e.g., U.S. Application Ser. No. 08/431,048, filed on April 28 
1995) . f ^o. 

PygggpiliTi-i prnrpin , AS used herein, the term "presenilin-l 
protein" or -PSl protein- means a protein encoded by a PSl gene 
including allelic variants and heterospecif ic mammalian 
homologues, Qne human presenilin-i (hPSl). protein sequence is 
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disclosed herein as SEQ ID NO: 2. Another human PSl protein 
sequence, resulting from alternative splicing of the hPSl mRNA 
transcript, is disclosed as SEQ ID NO: 4. Additional human 
splice variants, as described below, have also been found in 
5 which a region including thirty-three residues may be spliced-out 
in some transcripts • These variants are also embraced by the 
term presenilin-l protein as used herein, A protein sequence of 
the murine homologue (mPSl) is disclosed as SEQ ID NO: 17. The 
protein may be produced by recombinsmt cells or organisms, may be 

10 substantially purified from natural tissues or cell lines, or may 
be synthesized chemically or enzymatically . Therefore, the term 
"presenilin-l protein" or "PSl protein" is intended to include 
the protein in glycosylated, partially glycosylated, or 
unglycosylated forms, as well as in phosphorylated, partially 

15 phosphorylated, unphosphorylated, sulphated, partially sulphated, 
or unsulphated forms. The term also includes allelic variants 
and other functional equivalents of the PSl amino acid sequence, 
including biologically active proteolytic or other fragments. 
This protein has also been referred to as the S182 protein (e.g., 

20 Sherrington et al., 1995) or as the Alzheimer's Related Membrane 
Protein (ARMP) (e.g., U,S. Application Ser. No. 08/431,048, filed 
on April 28, 1995) . 

hPSl gene and/or protein. As used herein, the abbreviation 
"hPSl" refers to the human homologue and human allelic variants 

25 of the PSl gene and/or protein. Two cDNA sequences of the human 
PSl gene are disclosed herein as SEQ ID NO: 1 and SEQ ID NO: 3. 
The corresponding hPSl protein sequences are disclosed herein as 
SEQ ID NO: 2 and SEQ ID NO: 4. Numerous allelic variants, 
including deleterious mutants, are disclosed and enabled 

30 throughout the description which follows. 

mPSl gene and/or protein. As used herein, the abbreviation 
"mPSl" refers to the murine homologues and murine allelic 
variants of the PSl gene and/or protein. A cDNA sequence of one 
murine PSl gene is disclosed herein as SEQ ID NO: 16. The 

3 5 corresponding mPSl protein sequence is disclosed herein as SEQ ID 
NO: 17, Allelic variEuits, including .deleteripus mutants, are 
enabled in the description which follows. 

Presenilin-2 aene. As used herein, the term "presenilin-2 gene" 
or "PS2 gene" means the mammalian gene first disclosed and 
40 described in U.S. Application Ser. No. 08/496,841, filed on June 
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28, 1995, and later described in Rogaev et al. (1995) and Levy- 
Lahad et al. (1995), including any allelic variants and 
heterospecific manunalian homologues. One human presenilin-2 
(hPS2) cDNA sequence is disclosed herein as SEQ ID NO: 18 . 
Additional human splice variants, as described below, have also 
been found in which a single codon or a region encoding thirty- 
three residues may be spliced-out in some transcripts. The term 
"presenilin-2 gene" or "PS2 gene" primarily relates to a coding 
sequence, but can also include some or all of the flanking 
regulatory regions and/or introns. The term PS2 gene 
specifically includes artificial or recombinant genes created 
from CDNA or genomic DNA, including recombinant genes based upon 
splice variants. The presenilin-2 gene has also been referred to 
as the E5-1 gene (e.g., Rogaev et al., 1995; U.S. Application 
Ser. No. 08/496,841. filed on June 28, 1995) or the STM2 gene 
(e.g., Levy-Lahad et al . , 1995). 
Presenilin-2 prote;^!; , As used herein, the term •'presenilin-2 
protein" or "PS2 protein" means a protein encoded by a PS2 gene, 
including allelic variants and heterospecific mammalian 
20 homologues. One human presenilin-2 (hPS2) protein sequence is 
disclosed herein as SEQ ID NO: 19. Additional human splice 
variants, as described below, have also been found in which a 
single residue or a region including thirty-three residues may be 
spliced-out in some transcripts. These variants are also 
embraced by the term presenilin-2 protein as used herein. The 
protein may be produced by recombinant cells or organisms, may be 
substantially purified from natural tissues or cell lines, or may 
be synthesized chemically or enzymatically . Therefore, the term 
"presenilin-2 protein" or "PS2 protein" is intended to include 
the protein in glycosylated, partially glycosylated, or 
unglycosylated forms, as well as in phosphorylated, partially 
phosphorylated, unphosphorylated, sulphated. partially sulphated. 
or unsulphated forms. The term also includes allelic variants 
and other functional equivalents of the PS2 amino acid sequence, 
including biologically active proteolytic or other fragments. 
, This protein has also , been referred to as the E5-1 protein (e g 
Sherrington et al., 1995; U.S. Application Ser. No. 08/496,841 
filed on June 28, 1995) or the STM2 protein (e.g.. Levy-Lahad et 
al. , 1995) . 

*° hPgj^ .qfflg nng/Pr Pmftin , as used herein, the abbreviation 
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»hPS2" refers to the human homologue and human allelic variants 
of the PS2 gene and/or protein. One cDNA sequences of the human 
PS2 gene is disclosed herein as SEQ ID NO: 18. The corresponding 
hPS2 protein sequence is disclosed herein as SEQ ID NO: 19. 
Numerous allelic variants, including deleterious mutants, are 
disclosed and enabled throughout the description which follows. 
pinPS q^ff? apd/pr prgtftiP, As used herein, the abbreviation 
"DmPS" refers to the pyp^gpfrj^^^ homologues and allelic variants 
of the PSl and PS2 genes/proteins. This definition is understood 
to include nucleic acid and amino acid sequence polymorphisms 
wherein substitutions, insertions or deletions in the gene or 
protein sequence do not affect the essential function of the gene 
product. The nucleotide sequence of one cDNA of the DmPS gene is 
disclosed herein as SEQ ID NO: 20 and the corresponding amino 
acid sequence is disclosed as SEQ ID NO: 21. The term "DmPS 
gene" primarily relates to a coding sequence but can also include 
some or all of the flanking regulatory regions and/or introns. 
IJCESal^ AS used herein with respect to genes, the term dnormald 
refers to a gene which encodes a normal protein. As used herein 
with respect to proteins, the term 6normal6 means a protein 
which performs its usual or normal physiological role and which 
is not associated with, or causative of, a pathogenic condition 
or state. Therefore, as used herein, the term 6normal6 is 
essentially synonymous with the usual meaning of the phrase dwild 
type. 6 For any given gene, or corresponding protein, a 
multiplicity of normal allelic variants may exist, none of which 
is associated with the development of a pathogenic condition or 
state, such normal allelic variants include, but are not limited 
to, variants in which one or more nucleotide substitutions do not 
result in a change in the encoded amino acid sequence. 
Hlltant^ As used herein with respect to genes, the term 6mutant6 
refers to a gene which encodes a mutant protein. As used herein 
with respect to proteins, the term 6mutant6 means a protein which 
does not perform its usual or normal physiological role and which 
IS associated with, or causative of, a pathogenic condition or 
state. Therefore, as used herein, the term 6mutant6 is 

essentially synonymous with the terms ^dysfunctional, 6 
dpathogenic,6 ddisease-causing.a and 6deleteriou8.6 With respect 
to the presenilin genes and proteins of the present invention, 
the term 6mutant6 refers to presenilin genes /proteins bearing one 
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or more nucleotide/amino acid substitutions, insertions and/or 
deletions which typically lead to the development of the symptoms 
of Alzheimer's Disease and/or other relevant inheritable 
phenotypes (e.g. cerebral hemorrhage, mental retardation, 
schizophrenia, psychosis, and depression) when expressed in 
humans. This definition is understood to include the various 
mutations that naturally exist, including but not limited to 
those disclosed herein, as well as synthetic or recombinant 
mutations produced by human intervention. The term "mutant," as 
applied to the presenilin genes, is not intended to embrace 
sequence variants which, due to the degeneracy of the genetic 
code, encode proteins identical to the normal sequences disclosed 
or otherwise enabled herein; nor is it intended to embrace 
sequence variants which, although they encode different proteins, 
encode proteins which are functionally equivalent to normal 
presenilin proteins. 

F\m<?t;:ion?l equivalent, As used herein in describing gene 
sequences and amino acid sequences, the term "functional 
equivalent" means that a recited sequence need not be identical 
to a particularly disclosed sequence of the SEQ ID NOs but need 
only provide a sequence which functions biologically and/or 
chemically as the equivalent of the disclosed sequence. 
Stfggt^y^tiallY gvrg, As used herein with respect to proteins 
(including antibodies) or other preparations, the term 
"substantially pure" means a preparation which is at least 60% by 
weight (dry weight) the compound of interest. Preferably the 
preparation is at least 75%, more preferably at least 90%, and 
most preferably at least 99%, by weight the compound of interest. 
Purity can be measured by any appropriate method, e.g., column 
chromatography, gel electrophoresis, or HPLC analysis. 

With respect to proteins, including antibodies, if a 
preparation includes two or more different compounds of interest 
(e.g., two or more different antibodies, immunogens, functional 
domains, or other polypeptides of the invention), a 
"substantially pure" preparation means a preparation in which the 
total weight (dry weight) o,f all tbe compounds of interest is at 
least 60% of the total dry weight. Similarly, for such 
preparations containing two or more compounds of interest, it is 
preferred that the total weight of the compounds of interest be 
at least 75%, ^ore preferably at least 90%, and most preferably 
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at least 99%, of the total dry weight of the preparation. 
Isolated nucleic acid. As used herein, an "isolated nucleic 
acid" is a ribonucleic acid, deoxyribonucleic acid, or nucleic 
acid analog comprising a polynucleotide sequence that has been 
5 isolated or separated from sec[uences that are immediately 
contiguous {one on the 5' end and one on the 3' end) in the 
naturally occurring genome of the organism from which it is 
derived. The term therefore includes, for example, a recombinant 
nucleic acid which is incorporated into a vector, into an 

10 autonomously replicating plasmid or virus, or into the genomic 
DNA of a prokaryote or eukaryote; or which exists as a separate 
molecule (e*g., a cDNA or a genomic DNA fragment produced by PGR 
or restriction endonuclease treatment) independent of other 
sequences. It also includes a recombinant DNA which is part of a 

15 hybrid gene encoding additional polypeptide sequences and/or 
including exogenous regulatory elements. 
Substantially identical sequence. As used herein, a 
"siibstantially identical" amino acid sequence is an amino acid 
sequence which differs only by conservative amino acid 

20 substitutions, for example, substitution of one amino acid for 

another of the same class (e.g., valine for glycine, arginine for 
lysine, etc.) or by one or more non- conservative substitutions, 
deletions, or insertions located at positions of the amino acid 
sequence which do not destroy the function of the protein 

25 (assayed, e.g., as described herein). Preferably, such a 
sequence is at least 85%, more preferably 90%, and most 
preferably 95% identical at the amino acid level to the sequence 
of the protein or peptide to which it is being compared. For 
nucleic acids, the length of comparison sequences will generally 

30 be at least 50 nucleotides, preferably at least 60 nucleotides, 
more preferably at least 75 nucleotides, and most preferably 110 
nucleotides. A "substantially identical" nucleic acid sequence 
codes for a substantially identical amino acid sec[uence as 
defined above. 

35 Transformed cell. As used herein, a "treoisf ormed cell" is a cell 
into which (or into an ancestor of which) has been introduced, by 
meeuis of recombinant DNA techniques, a nucleic acid molecule of 
interest. The nucleic acid of interest will typically encode a 
peptide or protein. The transformed cell may express the 

40 sequence of interest or may be used only to propagate the 
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sequence. The term "transformed" may be used herein to embrace 
any method of introducing exogenous nucleic acids including, but 
not limited to, transformation, transfection, elect roporat ion, 
microinjection, viral -mediated tremsf ection, and the like. 
Qperablv -joined. As used herein, a coding sequence emd a 
regulatory region are said to be "operably joined" when they are 
covalently linked in such a way as to place the expression or 
transcription of the coding sequence \mder the influence or 
control of the regulatory region. If it is desired that the 
coding sequences be translated into a functional protein, two DNA 
sequences are said to be operably joined if induction of promoter 
function results in the transcription of the coding sequence and 
if the nature of the linkage between the two DNA sequences does 
not (1) result in the introduction of a frame-shift mutation, (2) 
interfere with the ability of the regulatory region to direct the 
transcription of the coding sequences, or (3) interfere with the 
ability of the corresponding RNA transcript to be translated into 
a protein. Thus, a regulatory region would be operably joined to 
a coding sequence if the regulatory region were capable of 
effecting transcription of that DNA sequence such that the 
resulting transcript might be translated into the desired protein 
or polypeptide. 

Stringent hybridization gonriitione. stringent hybridization 
conditions is a term of art understood by those of ordinary skill 
in the art. For any given nucleic acid sequence, stringent 
hybridization conditions are those conditions of temperature, 
chaotrophic acids, buffer, and ionic strength which will permit 
hybridization of that nucleic acid sequence to its complementary 
sequence and not to substantially different sequences. The exact 
conditions which constitute "stringent" conditions, depend upon 
the nature of the nucleic acid sequence, the length of the 
sequence, and the frequency of occurrence of subsets of that 
sequence within other non-identical sequences. By varying 
hybridization conditions from a level of stringency at which non- 
specific hybridization occurs to a level at which only specific 
hybridization, is observed, one of ordinary, skill in the a^rt can,, 
without undue experimentation, determine conditions which will 
allow a given sequence to hybridize only with complementary 
sequences. Suitable ranges of such stringency conditions are 
described in Krause and Aaronson (1991) . Hybridization 
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conditions, depending upon the length and commonality of a 
sequence, may include temperatures of 20«»C-65oc and ionic 
strengths from 5x to O.lx SSC. Highly stringent hybridization 
conditions may include temperatures as low as 40-42OC (when 
denaturants such as formamide are included) or up to 60-65«C in 
ionic strengths as low as O.lx SSC. These ranges, however, are 
only illustrative and, depending upon the nature of the target 
sequence, and possible future technological developments, may be 
more stringent than necessary. Less than stringent conditions 
are employed to isolate nucleic acid sequences which are 
substantially similar, allelic or homologous to any given 
sequence . 

gglggt^vglv binriB , As used herein with respect to antibodies, an 
antibody is said to "selectively bind" to a target if the 
antibody recognizes and binds the target of interest but does not 
substantially recognize and bind other molecules in a sample, 
e.g., a biological sample, which includes the target of interest. 
II- The Present;! -inp 

The present invention is based, in part, upon the discovery 
of a family of mammalian genes which, when mutated, are 
associated with the development of AlzheimerOs Disease. The 
discovery of these genes, designated presenilin-l and presenilin- 
2, as well as the characterization of these genes, their protein 
products, mutants, and possible functional roles, are described 
25 below. Invertebrate homologues of the presenilins are also 
discussed as they may shed light on the function of the 
presenilins and to the extent they may be useful in the various 
embodiments described below. 
^- Ig9l^^;i(?n of the Human Pre«An.- 1 iy,--, n.^j^ ^ 
30 A. genetic Mapping of ^ ^e AD3 ltf,n\^p 

The initial isolation and characterization of the PSl gene, 
then referred to as the AD3 gene or S182 gene, was described in' 
Sherrington et al (1995). After the initial regional mapping of 
the AD3 gene locus to 14q24.3 near the anonymous microsatellite 
markers D14S43 and D14S53 (Schellenberg et al., 1992; St.. George- 
Hyslop et al., 1992; Van Broeckhoven et .al.. 199.2), twenty one. 
pedigrees were used to segregate AD as a putative autosomal 
dominant trait (St. George-Hyslop et al., 1992) and to 
investigate the segregation of 18 additional genetic markers from 
the I4q24.3 region which had been organized into. a high density 
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genetic linkage map (Weissenbach et al., 1992; Gyapay et al., 
1994) . Previously published pairwise maximum likelihood analyses 
confirmed substantial cumulative evidence for linkage between 
familial Alzheimer's Disease (FAD) and all of these markers. 
5 However, much of the genetic data supporting linkage to these 
markers were derived from six large early onset pedigrees, PADl 
(Nee et al., 1983). FAD2 (Froramelt et al., 1991), FAD3 (Goudsmit 
et al., 1981; Pollen, 1993), FAD4 (Foncin et al., 1985), TORl.l 
(Bergamini, 1991) and 603 (Pericak- Vance et al., 1988), each of 
10 which provides at least one anonymous genetic marker from I4q24.3 
(St. George-Hyslop et al,, 1992). 

In order to define more precisely the location of the AD3 
gene relative to the known locations of the genetic markers from 
14q24.3, recombinational lemdmarks were sought by direct 

15 inspection of the raw haplotype data from those genotyped 

affected members of the six pedigrees showing definitive linkage 
to chromosome 14. This selective strategy in this particular 
instance necessarily discards data from the reconstructed 
genotypes of deceased affected members as well as from elderly 

20 asymptomatic members of the large pedigrees, and takes no account 
of the smaller pedigrees of uncertain linkage status. However, 
this strategy is very sound because it also avoids the 
acquisition of potentially misleading genotype data acquired 
either through errors in the reconstructed genotypes of deceased 

25 affected members arising from non-paternity or sampling errors or 
from the inclusion of unlinked pedigrees. 

Upon inspection of the haplotype data for affected subjects, 
members of the six large pedigrees whose genotypes were directly 
determined revealed obligate recombinants at D14S48 and D14S53, 

30 and at D14S258 and D14S63. The single recombinant at D14S53, 
which depicts a telomeric boundary for the FAD region, occurred 
in the same AD affected subject of the FADi pedigree who had 
previously been foxind to be recombinant at several other markers 
located telomeric to D14S53, including D14S48 (St. George-Hyslop 

35 etal. , 1992). Conversely, the single recombinant at D14S258, 

which marks a centromeric boundary of the FAD region, occurred in 
an affected member of the FAD3 pedigree who was also recombinant 
at several other markers centromeric to D14S258 including D14S63. 
Both recombinant subjects had unequivocal evidence of Alzheimer's 

40 Disease confirmed through standard clinical tests for the illness 
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in other affected members of their families, and the genotype of 
both recombinant subjects was informative and co-segregating at 
multiple loci within the interval centromeric to D14S53 and 
telomeric to D14S258. 
5 When the haplotype analyses were enlarged to include the 

reconstructed genotypes of deceased affected members of the six 
large pedigrees as well as data from the remaining fifteen 
pedigrees with probabilities for linkage of less than 0.95, 
several additional recombinants were detected at one or more 

10 marker loci within the interval between D14S53 and D14S258. 
Thus, one additional recombinant was detected in the 
reconstructed genotype of a deceased affected member of each of 
three of the larger FAD pedigrees (FADl, FAD2 and other related 
families), and eight additional recombinants were detected in 

15 affected members of five smaller FAD pedigrees. However, while 
some of these recombinants might have correctly placed the AD3 
gene within a more defined target region, it was necessary to 
regard these potentially closer "internal recombinants" as 
unreliable not only for the reasons discussed earlier, but also 

20 because they provided mutually inconsistent locations for the AD3 
gene within the D14S53-D14S258 interval. 

®- gonstr^gt^pn of a PhYBirnl Cont^cr fin^^^i ^ a th^ i>m ^ 0^^^^ 

As an initial step towards cloning the AD3 gene, a contig of 
overlapping genomic DNA fragments cloned into yeast artificial 

i5 chromosome vectors, phage artificial chromosome vectors and 
cosraid vectors was constructed. FISH mapping studies using 
cosmids derived from the YAC clones 932c7 and 964f5 suggested 
that the interval most likely to carry the AD3 gene was at least 
five megabases in size. Because the large size of this minimal 

0 co-segregating region would make positional cloning strategies 
intractable, additional genetic pointers were sought which 
focused the search for the AD3 gene to one or more subregions 
within the interval flanked by D14S53 and D14S258. Haplotype 
analyses at the markers between D14S53 and D14S258 failed to 

5 detect statistically significant evidence for linkage 

disequilibrium and/or allelic association between the FAD trait 
and alleles at any of these markers, irrespective of whether the 
analyses were restricted to those pedigrees with early onset 
forms of FAD, or were generalized to include all pedigrees. This 

0 result was not unexpected given the diverse ethnic origins of our 
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pedigrees. However, when pedigrees of similar ethnic descent 
were collated, direct inspection of the haplotypes observed on 
the disease-bearing chromosome segregating in different pedigrees 
of similar ethnic origin revealed two clusters of marker loci. 
The first of these clusters located centromeric to D14S77 
(D14S786, D14S277 and D14S268; and spanned the 0.95 Mb physical 
interval contained in YAC 78842. The second cluster was located 
telomeric to D14S77 (D14S43, D14S273, and D14S76) and spanned the 
- 1Mb physical interval included within the overlapping YAC 
clones 964C2, 74163, 797dll and part of 854f5. Identical alleles 
were observed in at least two pedigrees from the same ethnic 
origin. As part the strategy, it was reasoned that the presence 
of shared alleles at one of these groups of physically clustered 
marker loci might reflect the co- inheritance of a small physical 
15 region surrounding the PSl gene on the original founder 

chromosome in each ethnic population. Significantly, each of the 
shared extended haplotypes were rare in normal Caucasian 
populations and allele sharing was not observed at other groups 
of markers spanning similar genetic intervals elsewhere on 
20 chromosome I4q24.3. 

^- Tranggription Mapping and Analy g is of rarirtidate g^nA« 

To isolate expressed sequences encoded within both critical 
intervals, a direct selection strategy was used involving 
immobilized, cloned, human genomic DNA as the hybridization 
25 target to recover transcribed sequences from primary 

complementary DNA pools derived from human brain mRNA (Rommens et 
al., 1993). Approximately 900 putative cDNA fragments of size 
100 to 600 base pairs were recovered from these regions. These 
fragments were hybridized to Southern blots containing genomic 
30 DNAs from each of the overlapping YAC clones and genomic DNAs 
from humans and other mammals. This identified a subset of 151 
clones which showed evidence for evolutionary conservation and/or 
for a complex structure which suggested that they were derived 
from spliced mRNA. The clones within this subset were collated 
on the basis of physical, map location, cross-hybridization and 
nucleotide sequence, and were used to, screen conventional human 
brain cDNA libraries for longer cDNAs. At least 19 independent 
CDNA clones over l kb in length were isolated and then aligned 
into a partial transcription map of the AD3 region. Only three 
of these transcripts corresponded tp known characterized genes 
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(cFOS, dihydrolipoamide succinyl transferase, and latent 
transforming growth factor binding protein 2) . 
D. Recovery of Candidate Genes 

Each of the open reading frame portions of the csmdidate 
5 genes were recovered by RT-PCR from mRNA isolated from post- 
mortem brain tissue of normal control siibjects and from either 
post-mortem brain tissue or cultured fibroblast cell lines of 
affected members of six pedigrees definitively linked to 
chromosome 14. The RT-PCR products were then screened for 

10 sequence differences using chemical cleavage and restriction 

endonuclease fingerprinting single-strand sequence conformational 
polymorphism methods (Saleeba and Cotton, 1993; Liu and Sommer, 
1995), and by direct nucleotide sequencing. With one exception, 
all of the genes examined, although of interest, did not contain 

15 alterations in sequences that were unique to affected svibjects, 
or co-segregated with the disease. The single exception was the 
candidate gene represented by clone S182 which contained a series 
of nucleotide changes not observed in normal sxabjects, and which 
were predicted to alter the amino acid sequence in affected 

20 subjects. The gene corresponding to this clone has now been 
designated as presenilin-1 (PSl) . Two PSl cDNA sequences, 
representing alternative splice variants described below, are 
disclosed herein as SEQ ID NO: 1 and SEQ ID NO: 3. The 
corresponding predicted amino acid sequences are disclosed as SEQ 

25 ID NO: 2 and SEQ ID NO: 4, respectively. Bluescript plasmids 
bearing clones of these cDNAs have been deposited at the ATCC, 
Rockville, Md., under ATCC Accession Numbers 97124 and 97508 on 
April 28, 1995. Sequences corresponding to SEQ ID NO: 1 and SEQ 
ID NO: 2 have also been deposited in the GenBank database and may 

30 be retrieved through Accession # 42110. 

2. Isolation of the Murine Presenilin-1 Gene 

A murine homologue (mPSl) of the human PSl gene was 
recovered by screening a mouse cDNA library with a labelled human 
DNA probe from the hPSl gene. In this manner, a 2 kb partial 

35 trfiuiscript (representing the 3' end of the gene) and several RT- 
PCR products representing the 5' end were recovered. Sequencing 
of the consensus cDNA transcript of the murine homologue revealed 
substantial amino acid identity with hPSl. Importaintly, as 
detailed below, all of the amino acids that were mutated in the 

40 FAD pedigrees were conserved between the mi^rine homologue and the 
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normal human variant. This conservation of the PSl gene 
indicates that an orthologous gene exists in the mouse (mPSl) , 
and that it is now possible to clone other mammalian homologues 
or orthologues by screening genomic or cDNA libraries using human 
5 PSl probes. Thus, a similar approach will make it possible to 
identify and characterize the PSl gene in other species. The 
nucleic acid sequence of the mPSl clone is disclosed herein as 
SEQ ID NO: 16 and the corresponding amino acid sequence is 
disclosed as SEQ ID NO: 17, Both sequences have been deposited 
10 in the GenBank database and may be retrieved through Accession # 

42177. 

3. Isolation of the Human Presenilin-2 Gene 

A second human gene, now designated presenilin-2 (PS2) , has 
been isolated and demonstrated to share substantial nucleotide 

15 and amino acid homology with the PSl gene. The initial isolation 
of this gene is described in detail in Rogaev et al, (1995). 
Isolation of the human PS2 gene (referred to as "STM2") by nearly 
identical methods is also reported in Levy-Lahad et al. (1995). 
Briefly, the PS2 gene was identified by using the nucleotide 

20 sequence of the cDNA for PSl to search data bases using the 
BLASTN paradigm of Altschul et al. (1990). Three expressed 
sequence tagged sites (ESTs) identified by Accession #s TG3796, 
R14600, and R05907 were located which had substantial homology (p 
< 1.0 e-"°, greater than 97% identity over at least 100 contiguous 

25 base pairs) • 

Oligonucleotide primers were produced from these sequences 
and used to generate PGR products by reverse transcriptase PGR 
(RT-PGR) . These short RT-PGR products were partially sequenced 
to confirm their identity with the sequences within the data base 

3 0 and were then used as hybridization probes to screen full-length 
CDNA libraries. Several different cDNAs ranging in size from 1 
kb to 2.3 kb were recovered from a cancer cell cDNA library 
(Caco2) cuid from a human brain cDNA library (E5-1, Gl-l, ccS4, 
cc32) . The nucleotide sequence of these clones confirmed that 

35 all were derivatives of the same transcript. 

The gene encoding the transcript, the PS2 gene, mapped to : 
human chromosome 1 using hybrid mapping panels to two clusters of 
GEPH Mega YAG clones which have been placed upon a physical 
contig map (YAG clones 7S0g7, 921dl2 mapped by FISH to lq41; and 

40 YAG clone 787gi2 mapped to Ip36.1-p35) . The nucleic acid 



Cf IPC I f f'l f^TfP ^ 09m» ^ 



wo 96/34099 




PCT/CA96/00263 



- 26 - 

sequence of the hPS2 clone is disclosed herein as SEQ ID NO: 18 
and the corresponding amino acid sequence is disclosed as SEQ ID 
NO: 19. Both sequences have been deposited in the GenBank 
database and may be retrieved through Accession # L44577. The 
5 DNA sequence of the hPS2 clone also has been incorporated into a 
vector and deposited at the ATCC, Rockville, MD., under ATCC 
Accession Number 97214 on June 28, 1995, 
4. Identification of Homolooues in C. e leaans and D, 
melanoaaster 

10 A. SPE>4 of C. eleaans 

Comparison of the nucleic acid and predicted amino acid 
sequences of PSl with available databases using the BLAST 
alignment paradigms revealed modest amino acid similarity with 
the C. eleaans sperm integral membrane protein SPE-4 (P « 1.5e- 

15 25, 24-37% identity over three groups of at least fifty residues) 
and weaker similarity to portions of several other membreme 
spanning proteins including mammalian chromogranin A and the 
alpha subunit of mammalian voltage dependent calcium cheuinels 
(Altschul et al., 1990), Amino-acid sequence similarities across 

20 putative transmembrcme domains may occasionally yield alignment 
that simply arises from the limited number of hydrophobic amino 
acids, but there is also extended sequence alignment between PSl 
and SPE-4 at several hydrophilic domains. Both the putative PSl 
protein and SPE-4 are predicted to be of comparable size (467 and 

25 465 residues, respectively) and, as described more fully below, 
to contain at least seven transmembrauie domains with a large 
acidic domain preceding the final predicted transmembrane domain. 
The PSl protein does have a longer predicted hydrophilic region 
at the N terminus. 

30 BLASTP alignment analyses also detected significant homology 

between PS2 and the C. eleaans SPE-4 protein (p « 3.5e-26; 
identity = 20-63% over five domains of at least 22 residues) , and 
weak homologies to brain sodium channels (alpha III s\ibunit) and 
to the alpha sxibunit of voltage dependent calcium channels from a 

35 variety of species (p = 0.02; identities 20-28% over two or more 
domains each of at least 35 residues) (Altschul, 199Q) • These 
alignments are similar to those described above for the PSl gene. 
B. Sel-12 of C. eleaans 

The 461 residue Sel-12 protein from C. eleaans and S182 (SEQ 

40 ID NO: 2) were found to share 48% sequence identity over 460 



15 



WO 96/34099 W W PCT/CA96/00263 

- 27 - 

amino acids (Levitan and Greenwald, 1995) . The Sel-12 protein 
also is believed to have multiple transmembrane domains. The 
sel-12 gene (Accession number U35660) was identified by screening 
for suppressors of a lin-12 gain- of -function mutation, and was 
5 cloned by transformation rescue (Levitan and Greenwald, 1995) . 
C. DmPS of D. melanQcr;^B^«.y 

Redundant oligonucleotides coding for highly conserved 
regions of the presenilin/sel 12 proteins were prepared and used 
to identify relevant mRNAs from adult and embryonic 
10 PffSlangq^^^^r . These mRNAs were sequenced and shown to contain an 
open reading frame with a putative amino acid sequence highly 
homologous to that of the human presenilins. The DmPS cDNA is 
identified as SEQ ID NO: 20. 

This sequence encodes a polypeptide of 541 amino acids (SEQ 
ID NO: 21) with about 52% identity to the human presenilins. 

The structure of the D. melanocT;iBt-^^ homologue is similar to 
that of the human presenilins with at least seven putative 
transmembrane domains (Kyte-Doolittle hydrophobicity analyses 
using a window of 15 and cut-off of 1.5) . Evidence of at least 
one alternative splice form was detected in that clone pdsl3 
contained an ORF of 541 amino acids, whereas clones pds7, pdsl4 
and pdsl lacked nucleotides 1300-1341 inclusive. This 
alternative splicing would result in the alteration of Gly to Ala 
at residue 384 in the putative TM6-7 loop, and an in- frame fusion 
to the Glu residue at codon 399 of the longer ORF. The principal 
differences between the amino acid sequence of the 
ff<^j,an9qast?y and human genes were in the N-terminal acid 
hydrophilic domain and in the acidic hydrophilic portion of the 
TM6-»7 loop. The residues surroxanding the TM6-»7 loop are 
especially conserved (residues 220-313 and 451-524), suggesting 
that these are functionally important domains. Sixteen out of 
twenty residues identified to be mutated in human PSl or PS2 and 
giving rise to human FAD are conserved in the D. melanoaaHr^r- 
homologue. 

35 The DNA sequence of the DmPS gene as cloned has been 

incorporated into a Bluescript plasmid. This stable vector was 
deposited with the ATCC, Roc)cville. MD., under ATCG Accession 
Number 97428 on January 26, 1996. 

5- Characterization of the Hnr^ ^ n PresAn^^T ^ y^ftn^p 
40 A. hPSl Transcripts and G^n^ stnief,„^o . . 
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Hybridization of the PSl (S182) clone to northern blots 
identified a transcript expressed widely in many areas of brain 
and peripheral tissues as a major - 2.8 kb transcript and a minor 
transcript of - 7.5 kb (see, e.g.. Figure 2 in Sherrington et 
5 al., 1995). PSl is expressed fairly vmiformly in most regions of 
the brain and in most peripheral tissues except liver, where 
transcription is low. Although the identity of the - 7.5 kb 
transcript is unclear, two observations suggest that the - 2.8 kb 
transcript represents an active product of the gene. 

10 Hybridization of the PSl clone to northern blots containing mRNA 
from a variety of murine tissues, including brain, identifies 
only a single transcript identical in size to the - 2.8 kb human 
transcript. All of the longer cDNA clones recovered to date 
(2.6-2.8 kb), which include both 5' and 3' DTRs and which account 

15 for the - 2,8 kb band on the northern blot, have mapped 
exclusively to the same physical region of chromosome 14. 

From these experiments the - 7,5 kb transcript could 
represent either a rare alternatively spliced or polyadenylated 
isoform of the - 2.8 kb transcript, or could represent another 

20 gene with homology to PSl. A cDNA library from the Caco2 cell 

line which expresses high levels of both PSl and PS2 was screened 
for long transcripts. Two different clones were obtained, GL40 
and B53. Sequencing revealed that both clones contained a 
similar 5' UTR and an ORF which was identical to that of the 

25 shorter 2.8 kb transcripts in brain. 

Both clones contained an vinusually long 3' UTR. This long 
3' UTR represents the use of an alternate polyadenylation site 
approximately 3 kb further downstream. This long 3' UTR contains 
a number of nucleotide sequence motifs which result in 

30 palindromes or stem-loop structures. These structures are 

associated with mRNA stability and also translational efficiency. 
The utility of this observation is that it may be possible to 
create recombinant expression constructs and/or transgenes in 
which the upstream polyadenylation site is ablated, thereby 

35 forcing the use of the downstream polyadenylation site and the 
longer 3' UTR. Ija certain instances, thia may promote the 
stability of selected mRNA species, with preferential translation 
that could be utilized to. alter the balance of mutant versus 
wild- type transcripts in targeted cell lines, or even in vivo in 

40 the brain, either by germ line therapy or by the use of viral 
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vectors such as modified herpes simplex virus vectors as a form 
of gene therapy. 

The hPSl gene spans a genomic interval of at least 60 kb 
within a 200 kb PACl clone RPCI-1 54D12 from the Roswell Park PAC 
5 library and three overlapping cosmid clones 57-HlO, 1-G9, and 24- 
D5 from the Los Alamos Chromosome 14 cosmid library. Transcripts 
of the PSl gene contain RNA from 13 exons which were identified 
by reiterative hybridization of oligonucleotide and partial cDNA 
probes to subcloned restriction fragments of the PAC and cosmid 

10 clones, and by direct nucleotide sequencing of these subclones. 
The 5' UTR is contained within Exons 1-4, with Exons 1 and 2 
representing alternate 5' ends of the transcript. The ORF is 
contained in Exons 4 to 13, with alternative splicing events 
resulting in the absence of part of Exon 4 or all of Exon 9. 

15 Exon 13 also includes the 3' UTR. 

Unless stated otherwise, in the interests of clarity and 
brevity, all references to nucleotide positions in hPSl derived 
nucleotide sequences will employ the base numbering of SEQ ID NO: 
1 (L42110), an hPSl cDNA sequence starting with Exon 1. In this 

20 CDNA, Exon 1 is spliced directly to Exon 3, which is spliced to 
Exons 4-13. In SEQ ID NO: 1, Exon 1 spans nucleotide positions 1 
to 113, Exon 3 spans positions 114 to 195, Exon 4 spans positions 
196 to 335, Exon 5 spans positions 336 to 586, Exon 6 spans 
positions 587 to 728, Exon 7 spans positions 729 to 796, Exon 8 

25 spans positions 797 to 1017, Exon 9 spans positions 1018 to 1116, 
Exon 10 spans positions 1117 to 1203, Exon 11 spans positions 
1204 to 1377, Exon 12 spans positions 1378 to 1496, Exon 13 spans 
positions 1497 to 2765. Similarly, unless stated otherwise, all 
references to amino acid residue positions in hPSl derived 

30 protein sequences will employ the residue numbering of SEQ ID NO: 
2, the translation product of SEQ ID NO: l. 

Flanking genomic sequences have been obtained for Exons 1- 
12, and are presented in SEQ ID NOs: 5-14 (Accession numbers: 
L76518-L76527) . Genomic sequence 5' from Exon 13 has also been 

35 determined and is presented in SEQ ID NO: 15 (Accession number: 
_L76528) . SEQ ID NOs : 5-14 also include the conplete Exon 

sequences. SEQ ID NO: 15, however, does not include the 3' end 
of Exon 13. The genomic sequences corresponding to Exons 1 and 2 
are located approximately 240 bp apart on a 2.6 kb BamHI-Hindlll 

40 fragment, SEQ ID NO: 5. Exons 3 and 4 (which contains the ATG 
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start codon) are located on a separate 3 kb BamHI fragment. The 
complete sequence of Intron 2 between the BamHI site -850 bp 
downstream of Exon 2 and the BamHI site -600 bp upstream of Exon 
3 has not yet been identified, and was not immediately recovered 
5 by extended PCR using primers from the flanking BamHI sites, 
implying that Intron 2 may be large. 

Analysis of the nucleotide sequence surrounding Exons i and 
2 (SEQ ID NO: 5) revealed numerous CpG dinucleotides including a 
NotI restriction site in Intron i. Consensus sequences for 
10 several putative transcriptional regulatory proteins including 
multiple clusters of Activator Protein-2 {AP-2) , Signal 
Transducers and Activators of Transcription (STAT3) (Schindler 
and Darnell, 1995), Gamma Activator Sequences (GAS or STATl) , 
Multiple start site Element Downstream (MED) (Ince and Scot to, 
15 1995) , and GC elements were present in both Intron 1 and in the 
sequence 5' from Exon 1 (see SEQ ID NO: 5) . Two putative TATA 
boxes exist upstream of Exon 1, at bp 925-933 and 978-987 of SEQ 
ID NO: 5, and are followed by two putative transcription 
initiation (CAP or Chambon-Trifonov) consensus sequences at 1002- 
20 1007 bp and 1038-1043 bp 484 of SEQ ID NO: 5. In contrast, the 
sequences immediately upstream of Exon 2 lack TATA boxes or CAP 
sites, but are enriched in clusters of CpG islands. 

A schematic map of the structural organization of the hPSl 
gene is presented as Figure i. Non-coding exons are depicted by 
25 solid shaded boxes. Coding exons are depicted by open boxes or 
hatched boxes for alternatively spliced sequences. Restriction 
sites are indicated as: B - BamHI; E =. EcoRI; H « Hindlll; N = 
NotI, P - PstI; V - Pvull; X - Xbal. Discontinuities in the 
horizontal line between restriction sites represent undefined 
30 genomic sequences, cloned genomic fragments containing each exon 
are depicted by double-ended horizontal arrows. The size of the 
genomic subclones and Accession number for each genomic sequence 
are also provided. 

Predictions of DNA secondary structure based upon the 
nucleotide sequence within 290 bp upstream of Exon 1 and within 
intron i reveal several palindromes with stability greater than - 
16 kcal/mol. These secondary structure analyses also predict the 
presence of three stable stem- loop motifs (at bp 1119-1129/1214- 
1224; at bp 1387-1394/1462-1469; and at bp 1422-1429/1508-lSlS ; 
all in SEQ ID NO: 5) with a loop size sufficient to encircle a' 
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nucleosome (-76 bp) . Such stem loop structures are a common 
feature of TATA containing genes (Kollmar and Famham, 1993). 

A summary of the features in these 5' regions is presented 
in Tcdjle 1. All references to base positions are relative to SEQ 
5 ID NO: 5. 

The longest predicted open reading frame in SEQ ID NO: 1 
encodes a protein of 467 amino acids, SEQ ID NO: 2. The start 
codon for this open reading frame is the first in-phase ATG 
located downstream of a TGA stop codon. There are no classical 
10 Kozak consensus sequences around the first two in phase ATG 
codons (Sherrington et al., 1995). Like other genes lacking 
classical * strong' start codons, the putative 5' UTR of the human 
transcripts is rich in GC. 

Alternative Tran scription and Splicing of the hPSi 5' UTR 
15 Although the first three exons and part of the fourth exon 

contain non- translated sequences, analysis of multiple full 
length cDNA clones isolated from a hximan hippocampus cDNA library 
(Stratagene, La Jolla CA) and from a colon adenocarcinoma cell 
line (Caco2 from J. Rommens) revealed that in the majority of 
20 clones the initial sequences were derived from Exon l and were 
directly spliced to Exon 3 (Accession number L42110, SEQ ID NO: 
1). Less frequently (1 out of 9 clones), the initial transcribed 
sequences were derived from Exon 2 and were spliced onto Exon 3 
(Accession number L76517, SEQ ID NO: 3). Direct nucleotide 
25 sequencing of at least 40 independent RT-PCR transcripts isolated 
using a primer in Exon 1 failed to identify any clones containing 
both Exon 1 and Exon 2. Finally, inspection of the genomic 
sequence upstream of Exon 2 did not reveal a 3' splice site 
sequence. These observations argue that Exon 2 is a true initial 
30 exon rather than an alternative splice form of transcripts 

beginning in Exon 1 or an artifact of cDNA cloning. Furthermore, 
since a clone (cc44) containing Exon 2 was obtained from the same 
monoclonal Caco2 cell lines, it is likely that both Exon-1- 
containing transcripts and Exon-2- containing transcripts exist in 
35 the saitie cells. 

Tp test the predictions about tranacripfcion initiation 
based upon the nucleotide sequence of the 5' upstream region near 
Exon 1, we examined the 5' end sequence of three independent 
"full-length" cDNA clones containing Exon 1 (cc33, ccSB and cc48) 
40 and three sequences recovered by primer extension using an 
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antisense primer located in Exon 3, The furthest 5' extension 
was seen in the cDNA G40L, which mapped the most proximal 
transcription start site to position 1214 bp in the genomic 
sequence containing Exon 1 SEQ ID NO: 5 (L76518) , and which 
5 therefore corresponds to position -10 of SEQ ID NO: l. Two 
additional clones (cDNA cc48 and 5' RACE product #5) shared a 
common start site at position 1259 bp in the genomic sequence, 
SEQ ID NO: 5, which corresponds to position 34 in SEQ ID NO: i. 
The two remaining cDNAs, as well as the remaining 5' RACE clones, 

10 began at more distal positions within Exon 1. AS' RACE clone #8 
began at 1224 bp, equal to position 1 of SEQ ID NO: 1. None of 
these clones therefore extended to the predicted CAP site 
upstream of Exon 1. Due to the low prevalence of trauiscripts 
containing initial sequences from Exon 2, similar studies of 

15 their start sites were not performed • 

C. Alternative Splici ng of the hPSl ORF 

In addition to transcripts with different initial sequences, 
the analysis of multiple cDNA clones recovered from a variety of 
libraries also revealed two variations in PSl transcripts which 

20 affect the ORF. 

The first of these is the absence of 12 nucleotides from the 
3' end of Exon 4, nucleotides 324 to 335 of SEQ ID NO: 1. This 
would result from splicing of Exon 4 after nucleotide 323 instead 
of after nucleotide 335. Transcripts resulting from this 

25 alternative splicing of Exon 4 do not encode amino acid residues 
Val26-Arg27-Ser28-Gln29 of SEQ ID NO: 2. Transcripts resulting 
from these two alternative splicing events for Exon 4 were 
detected with approximately equal frequencies in all tissues 
surveyed. It is of note in the clones examined to date that the 

3 0 murine PSl transcripts do contain only the cDNA sequence for 

Ile26-Arg27-Ser2a-Gln29, and that the sequence for the Val-Arg- 
Ser-Gln motif is only partially conserved in human PS2 as Arg48- 
Ser49-Gln50 (Rogaev et al., 1995), Each of these observations 
suggests that these differences are not critical to proper PSl 

35 functioning. 

The second splicing variation affecting the ORF results in 
the absence of Exon 9, nucleotides 1018 to 1116 in SEQ ID NO: 1. 
Analysis of RT-PCR products derived from mRNA of a variety of 
tissues showed that brain (including neocortical areas typically 

40 affected by AD) and several other tissues (muscle, heart, Ixmg, 
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colon) predominemtly expressed a single transcript bearing Exon 
9. Leukocytes {but not lymphoblasts) on the other hand, also 
expressed a shorter form lacking Exon 9. Alternative splicing of 
Exon 9 is predicted to change an aspartate residue at position 
5 257 in SEQ ID NO: 2 to alanine, eliminate the next 3 3 residues, 
and result in an in- frame fusion to the rest of the protein 
beginning at the threonine at position 291 encoded in Exon 10. 
D. frpS2 Tr^nscrAptp 

The genomic DNA including the human PS 2 gene has not yet 

10 been fully characterized. Nonetheless, many similarities between 
the PSl and PS2 genes are apparent. The intron/exon boundaries 
of both genes, however, appear to be very similar or identical 
except in the region of the TM6-»7 loop. 

Hybridization of the PS2 cDNA clones to Northern Blots 

15 detected a -2.3 kb mRNA band in many tissues, including regions 
of the brain, as well as a -2.6kb mRNA band in muscle, cardiac 
muscle and pancreas. PS2 is expressed at low levels in most 
regions of the brain except the corpus callosum, where 
transcription is high. In skeletal muscle, cardiac muscle and 

20 pancreas, the PS2 gene is expressed at relatively higher levels 
than in brain and as two different transcripts of -2.3 kb and 
-2.6 kb. Both of the transcripts have sizes clearly 
distinguishable from that of the 2.7 kb PSl transcript, and did 
not cross -hybridize with PSl probes at high stringency. The cDNA 

25 sequence of one hPS2 allele is identified as SEQ ID NO: 18 
(Accession No. L44577) . 

The longest ORF within this PS2 cDNA consensus nucleotide 
sequence predicts a polypeptide containing 448 amino acids (SEQ 
ID NO: 19) niambering from the first in-phase ATG codon, at 

30 positions 366-368 in SEQ ID NO: 18, which was surroxinded by a 
Kozak consensus sequence. The stop codon is at positions 1710- 
1712, 

As for PSl, analysis of PS2 RT-PCR products from several 
tissues, including brain and muscle, RNA revealed two alternative 

35 splice variants in which a relatively large segment may be 

spliced out. Thus, at a relatively low frequency, tr2Uiscripts 
are produced in which nucleotides 1152-1250 of the PS2 
transcript, SEQ ID NO: 18, (encoding residues 263-295, SEQ ID NO: 
19) are alternatively spliced. As discussed below, this splicing 

40 ^yent corresponds closely to the alternative splicing of Exon 9 
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of PSl (Rogaev et al., 1995). 

An additional splice variant of the PS2 cDNA sequence 
lacking the GAA triplet at nucleotide positions 1338-1340 in SEQ 
ID NO: 18 has also been found in all tissues examined. This 
5 alternative splice results in the omission of a Glu residue at 
amino acid position 325. 

6- Structure of the Prgaenilin Prn^fi,r« 
A. The Presenilin Protein Fan^^ly 

The presenilins are now disclosed to be a novel family of 
10 highly conserved integral membrane proteins with a common 
structural motif, common alternative splicing patterns, and 
common mutational regions hot spots which correlate with putative 
structural domains which are present in many invertebrate and 
vertebrate animal cells. Analysis of the predicted amino acid 
15 sequences of the human presenilin genes using the Hopp and Woods 
algorithm suggests that the proteins are multispanning integral 
membrane proteins such as receptors, channel proteins, or 
Structural membrane proteins. A Kyte-Doolittle hydropathy plot 
of the putative hPSl protein is depicted in Figure 2. The 

20 hydropathy plot and structural analysis suggest that these 

proteins possess approximately seven hydrophobic transmembrane 
domains (designated TMl through TM7) separated by hydrophilic 
6loops.6 Other models can be predicted to have as few as 5 and 
as many as 10 transmembrane domains depending upon the parameters 

25 used in the prediction algorithm. The presence of seven membrane 
spanning domains, however, is characteristic of several classes 
of G-coupled receptor proteins, but is also observed with other 
proteins (e.g., channel proteins). The absence of a recognizable 
signal peptide and the paucity of glycosylation sites are 

30 noteworthy. 

The amino acid sequences of the hPSl and mPSl proteins are 
compared in Figure 3, and the sequences of the hPSl and hPS2 
proteins are compared in Figure 4. In each figure, identical 
amino acid residues are indicated by vertical bars. The seven 
35 jputative transmembrane domains are indicated by horizontal lines 
above or below the sequenqes. 

The major differences between members of this family reside 
in the amino acid sequences of the hydrophilic, acidic loop 
domains at the N- terminus and between the putative TM6 and TM7 
domains of the presenilin proteins (the TM6-»7 loop) . Most of the 
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residues encoded by hPSl Exon 9, which is alternatively spliced 
in some non-neural tissues, form part of the putative TM6-.7 loop. 
In addition, the corresponding alternative splice variant 
identified in hPS2 appears to encode part of the TM6-«7 loop. The 
variable splicing of this hydrophilic loop, and the fact that the 
amino acid sequence of the loop differs between members of the 
gene family, suggest that this loop is an important functional 
domain of the protein and may confer some specificity to the 
physiologic and pathogenic interactions of the individual 
presenilin proteins. Because the N-terminal hydrophilic domain 
shares the same acidic charge as the TM6-7 hydrophilic acid loop, 
and in a seven transmembrane domain model is likely to have the 
same orientation with respect to the membrane, and is also 
variable amongst the presenilins. it is very likely that these 
15 two domains share functionality either in a coordinated or 
independent fashion (e.g. the same or different ligands or 
functional properties). Thus, it is likely that the N-terminus 
is also an important functional domain of the protein and may 
confer some specificity to the physiologic and pathogenic 
interactions of the individual presenilin proteins. 

As detailed below, the pathogenic mutations in PSl and PS2 
cluster around the TMIh.2 loop and TM6-7 loop domains, further 
suggesting that these domains are the functional domains of these 
proteins. Figures 5 and 6 depict schematic drawings of predicted 
structures of the PSl and PS2 proteins, respectively, with the 
known mutational sites indicated on the figures. As shown in the 
figures, the TMIh.2 linking sequence is predicted to reside on the 
opposite side of the membrane to that of the N-terminus and TM6^7 
loop, and may be important in transmembrane communication. This 
is supported by the PSl YllSH mutation which was observed in a 
pedigree with early onset familial AD (30-40 years) and by 
additional mutations in the TMl/2 helices which might be expected 
to destabilize the loop. The TM1-.2 loop is relatively short 
(PSl: residues 101-132; PS2: residues 107-134) taaking these 
sequence more amenable to conventional peptide synthesis. Seven 
PSl mutations cluster in the region between .a*out codon 82. and 
codon 146, which comprises the putative first transmembrane 
domain (TMl) . the TMl-2 loop, and the TM2 domain in PSl. 
Similarly, a mutation at codon 141 of PS2 is also located in the 
40 TM2 domain. These mutations probably destabilize the TMl^2 loop 
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domain and its anchor points in TMl and TM2. Twelve PSl 
mutations result in the alteration of amino acids between about 
codons 246 and 410, which are involved in the TM6, TM6-»7 loop, 
and TM7 domains. These mutations may modify the structure or 
5 stability of the TM6-»7 loop (either directly or by modifying the 
conformation of TM6 or TM7) . 

Further evidence for an important functional role residing 
in the TM6-»7 loop is the sequence divergence in the central part 
of the TM6-.7 loop (approximately amino acids 300 to 371) among 
10 different members of the presenilin protein family. Similarly, 
because the N- terminus sequences of members of the presenilin 
protein family are also divergent, it is likely that the slightly 
divergent sequences play a role in conferring specificity to the 
function of each of the different presenilin proteins while the 
15 conserved sequences confer the common biologic activities. These 
regions may represent ligand binding sites. If this is so, 
mutations in the TM6-*7 region are likely to modify ligand binding 
activity. The TMl-»2 loop, which is conserved amongst different 
members of the presenilin protein family, probably represents an 
20 effector domain on the opposing membrane face. With the 

exception of the Exon 10 splicing mutation, most of the other 
(missense) mutations align on the same surfaces of putative 
transmembrane helices, which suggests that they may affect ligand 
binding or channel functions. Thus, these domains (e.g., TM6-»7 
25 and TMl-*2 loops) can be used as sites to develop specific binding 
agents to inhibit the effects of the mutations and/or restore the 
normal fxinction of the presenilin protein in subjects with 
Alzheimer's Disease. 

The similarity between the putative products of the 
30 alsa^is SPE-4 and the PSl genes implies that they may have 

similar activities. The SPE-4 protein appears to be involved in 
the formation and stabilization of the fibrous body-membrane 
organelle (FBMO) complex during spermatogenesis. The PBMO is a 
specialized Golgi-derived organelle, consisting of a membrane 
35 bound vesicle attached to and partly surrounding a complex of 

parallel protein fibers and may be involved in the transport and 
storage of soluble and membrane -bound polypeptides. Mutations in 
SPE-4 disrupt the FBMO complexes and arrest spermatogenesis. 
Therefore the physiologic function of SPE-4 may be either to 
40 stabilize interactions between integral membrane budding tod 



WO96AM099 



PCT/CA96AN>263 



37 



15 



20 



fusion events, or to stabilize interactions between the membrane 
and fibrillary proteins during the intracellular transport of the 
FBMO complex during spermatogenesis. Comparable fxanctions could 
be envisaged for the presenilins. For example, PSl could be 
5 involved either in the docking of other membrane -bound proteins 
such as /3APP, or the axonal transport and fusion budding of 
membrane -bound vesicles during protein transport, such as in the 
Golgi apparatus or endosome-lysosome system. If these hypotheses 
are correct, then mutations might be expected to result in 
10 aberrant transport and processing of /ffAPP and/or abnormal 

interactions with cytoskeletal proteins such as the microtubule- 
associated protein Tau. Abnormalities in the intracellular and 
in the extracellular disposition of both /SAPP and Tau are in fact 
an integral part of the neuropathologic features of Alzheimer's 
Disease. Although the location of the PSl and PS2 mutations in 
highly conserved residues within conserved domains of the 
putative proteins suggests that they are pathogenic, at least 
three of these mutations are themselves conservative, which is 
commensurate with the onset of disease in adult life. Because 
none of the mutations observed so far are deletions or nonsense 
mutations that would be expected to cause a complete loss of 
expression or function, we cannot predict whether these mutations 
will have a dominant gain- of -function effect, thus promoting 
aberrant processing of or a dominant lose-of- function effect 

causing arrest of normal /5APP processing. The Exon 10 splicing 
mutation causes an in-frame fusion of Exon 9 to Exon lO, and may 
have a structural effect on the PSl protein which could alter 
intracellular targeting or ligand binding, or miy otherwise 
affect PSl function. 

An alternative possibility is that the PSl gene product may 
represent a receptor or channel protein. Mutations of such 
proteins have been causally related to several other dominant 
neurological disorders in both vertebrate (e.g., malignant 
hyperthermia, hyperkalemic periodic paralysis in humans) an4 in 
35 invertebrate organisms (deg-l (d) mutants in C. eleaanH V 

Although the pathology of tfeese other disorders does not resemble 
that of Alzheimer's Disease, there is evidence for functional 
abnormalities in ion channels in Alzheimer's Disease. For 
example, anomalies have been reported in the tetra-ethylammonium- 
40 sensitive ll3pS potassium channel and in calcium homeostasis. 
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Perturbations in transmembrane calcium fluxes might be especially 
relevant in view of the weak homology between PSl and the a- ID 
subunit of voltage-dependent calcium channels and the observation 
that increases in intracellular calcium in cultured cells can 
replicate some of the biochemical features of Alzheimer's 
Disease, such as alteration in the phosphorylation of Tau- 
microtubule-associated protein and increased production of A/S 
peptides. 
B. hPSl Structure 

As shown in SEQ ID NO: 2, the largest known form of the 
human PSl protein comprises 467 amino acids and has a predicted 
molecular mass of approximately 51.37 kDa. A variant with the 
above -described alternative splicing of Exon 4 (in which the 
residues corresponding to positions 26-29 of SEQ id NO: 2 are 
deleted) would include 4 fewer amino acids and have a mass of 
approximately 50.93 kDa. Similarly, a variant with the above- 
described alternative splicing of Exon 9 {in which the residues 
corresponding to positions 258-290 of SEQ ID NO: 2 are deleted) 
would include 33 fewer amino acids and would have a molecular 
mass of approximately 47.74 kDa. 

The positions of the putative domains are presented in Table 2. 
Note again that the numbering of the residue positions is with 
respect to SEQ ID NO: 2 and is approximate (i.e. ± 2 residues). 

A schematic drawing of the putative PSl structure is shown 
in Fig. 5. The N-terminus is a highly hydrophilic. negatively 
charged domain with several potential phosphorylation domains, 
followed sequentially by a hydrophobic membrane spanning domain 
of approximately 19 residues (TMl) , a charged hydrophilic loop of 
approximately 32 residues (TMl-2) . five additional hydrophobic 
membrane spanning domains (TM2 through TM6) interspersed with 
short (1-15 residue) hydrophilic domains (TM2-»3 through TM5-6), 
an additional larger, acidic hydrophilic charged loop (TM6-7) and 
at least one (TM7) , and possibly two, other hydrophobic 
potentially membrane-spanning domains, culminating in a polar 
domain at the C- terminus. 

The protein also contains, a number of potential . 
phosphorylation sites, one of which is a MAP kinase consensus 
site which is also involved in the hyperphosphorylation of Tau 
during the conversion of normal Tau to neurofibrillary tangles. 
This consensus sequence may. provide a putative element linking 
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this protein's activity to other biochemical aspects of 
Alzheimer's Disease, and would represent a likely therapeutic 
target. Review of the protein structure reveals two sequences 
YTPF (residues llS-lie, SEQ id NO: 2) and STPE (residues 353-356, 
SEQ ID NO: 2) which represent the S/T-P motif which is the MAP 
kinase consensus sequence. Several other phosphorylation sites 
exist with consensus sequences for Protein Kinase C (pkc) 
activity. Because PKC activity is associated with differences in 
the metabolism of APP which are relevant to Alzheimer's Disease, 
these sites on the PSl protein and its homologues are also sites 
for targeting therapeutics. Preliminary evidence indicates that, 
at least in transfected cells, the PSl protein is phosphorylated 
only to a minor degree while the PS2 protein is significantly 
phosphorylated. For PS2 at least, it appears that this 
phosphorylation occurs on serine residues in the N-terminal 
domain by a mechanism which does not involve PKC (Capell et al 
1996). 

Note that the alternative splicing at the end of Exon 4 
removes four amino-acids from the hydrophilic N-terminal domain. 
20 and would be expected to remove a phosphorylation consensus 
sequence. In addition, the alternative splicing of Exon 9 
results in a truncated isoform of the PSl protein wherein the C- 
terminal five hydrophobic residues of TM6 and part of the 
hydrophilic negatively-charged TM6^7 loop immediately C- terminal 
25 to TM6 is absent. This alternatively spliced isoform is 

characterized by preservation of the sequence from the N-terminus 
up to and including the tyrosine at position 256 of SEQ ID NO: 2, 
changing of the aspartate at position 257 to alanine, and 
splicing to the C- terminal part of the protein from and including 
30 tyrosine 291. Such splicing differences are often associated 
with important functional domains of the proteins. This argues 
that this hydrophilic loop (and consequently the N-terminal 
hydrophilic loop with similar amino acid charge) is/are active 
functional domains of the PSl product and thus sites for 
35 therapeutic targeting. ' 
. c. Human PS2 structur-^ 

The human PSl and PS2 proteins show 63% over- all amino acid 
Identity and several domains display virtually complete identity 
As would be expected, therefore, hydrophobicity analyses suggest 
40 that both proteins also share a similar structural organization 
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Thus, both proteins are predicted to possess seven hydrophobic 
putative transmembrame domains, euid both proteins bear large 
acidic hydrophilic domains at the N-terminus and between TM6 and 
TM7. A further similarity was apparent from the eOaove- described 
5 analysis of RT-PCR products from brain and muscle RNA, which 
revealed that nucleotides 1153-1250 of the PS2 transcript are 
alternatively spliced. These nucleotides encode amino acids 263- 
296, which are located within the TM6-»7 loop domain of the 
putative PS2 protein and which share 94% sequence identity with 

10 the alternatively spliced amino acids 257-290 in PSl. 

The positions of the putative functional domains of the hPS2 
protein are described in Table 3. Note that residue positions 
refer to the residue positions of SEQ ID NO: 19, and that the 
positions are approximate (i.e., ± 2 residues). 

15 A schematic drawing of the putative PS2 structure is shown 

in Fig. 6. The similarity between hPSl and hPS2 is greatest in 
several domains of the protein corresponding to the intervals 
between TMl and TM6, and from TM7 to the C- terminus of the PSl 
protein. The major differences between PSl and PS2 are in the 

20 size and amino acid sequences of the negatively- charged 

hydrophilic TM6-»7 loops, and in the sequences of the N-terrainal 
hydrophilic domains. 

The most noticeable differences between the two predicted 
amino acid sequences occur in the amino acid sequence in the 

25 central portion of the TM6-*7 hydrophilic loop (residues 304-374 
of hPSl; 310-355 of hPS2) , and in the N-terminal hydrophilic 
domain. By analogy, this domain is also less highly conserved 
between the murine and humem PSl genes (identity = 47/60 
residues) , and shows no similarity to the equivalent region of 

30 SPE-4. 

7. Presenilin Mutants 
A. PSl Mutants 

Several mutations in the PSl gene have been identified which 
cause a severe type of familial Alzheimer's Disease . One or a 

35 combination of these mutations may be responsible for this form 
of Alzheimer;? Disease as well as, several other neurology 
disorders. The mutations may be any form of nucleotide sequence 
sxibstitution, insertion or deletion that leads to a change in 
predicted amino acid sequence or that leads to aberrant 

40 transcript processing, level or stability. Specific disease 
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causing mutations in the fom of nucleotide and/or amino acid 
deletions or substitutions are described below but it is 
anticipated that additional mutations will be fo\md in other 
families. Indeed, after the initial discovery of five different 
5 missense mutations amongst eight different pedigrees (Sherrington 
et al. 1995), it was expected from experience with other 
inherited disease (e.g.. Amyotrophic lateral sclerosis associated 
with mutations in the Ca'* superoxide dismutase gene) that 
additional mutations would be identified. This expectation has 
10 been fulfilled by our subsequent discovery of additional 

mutations in the presenilins (Rogaev et al., 1995) and by similar 
observations by others (e.g., Cruts et al., 1995; Campion et al., 
1995) . Thus, as used herein with respect to PSl genes and 
proteins, the term 6mutant6 is not restricted to these particular 
15 mutations but. rather, is to be construed as defined above. 

Direct sequencing of overlapping RT-PCR products spaiming 
the 2.8 kb S182 transcript isolated from affected members of the 
six large pedigrees linked to chromosome 14 led initially to the 
discovery of five missense mutations in each of the six 
20 pedigrees. Each of these mutations co-segregated with the 

disease in the respective pedigrees, and were absent from upwards 
of 142 unrelated neurologically normal subjects drawn from the 
same ethnic origins as the FAD pedigrees (284 unrelated 
chromosomes) . The location of the gene within the physical 
interval segregating with AD3 trait, the presence of eight 
different missense mutations which co-segregate with the disease 
trait in six pedigrees definitively linked to chromosome 14, and 
the absence of these mutations in 284 independent normal 
chromosomes cumulatively confirmed that the PSl gene is the AD3 
locus. Further biological support for this hypothesis arises 
from the facts that the residues mutated in FAD kindreds are 
conserved in evolution (e.g., hPSl v. mPSl) , that the mutations 
are located in domains of the protein which are also highly 
conserved in other vertebrate and invertebrate homologues, and 
thait the PSl gene product is expressed at high levels in most 
regions of the brain, including those most severely . affected by 
AD. 

Since the original discovery of the PSl gene, many 
additional mutations associated with the development of AD have 
40 been catalogued. Table 4 characterizes a number of these. Each 
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of the observed nucleotide deletions or substitutions occurred 
within the putative ORF of the PSl transcript, and would be 
predicted to change the encoded amino acid at the positions 
shown. The mutations are listed with reference to their 
5 nucleotide locations in SEQ ID NO: 1 and with reference to their 
amino acid positions in SEQ ID NO: 2. An entry of ^NA" indicates 
that the data was not available. 

As discussed in the next section, a number of PS2 mutations 
have also been found. A comparison of the hPSl and hPS2 

10 sequences is shown in Figure 4 and reveals that these pathogenic 
mutations are in regions of the PS2 protein which are conserved 
in the PSl protein. Therefore, corresponding mutations in the 
PSl protein may also be expected to be pathogenic and are 
included in the PSl mutants provided and enabled herein. 

15 Furthermore, any pathogenic mutation identified in any conserved 
region of a presenilin gene may be presumed to represent a mutant 
of the other presenilins which share that conserved region. 

Interestingly, mutations A260V, C263R, P264L, P267S, E280A, 
E280G, A285V, L286V, A291-319, G384A, L392V, and C410Y all occur 

20 in or near the acidic hydrophilic loop between the putative 
transmembrane domains TM6 and TM7, Eight of these mutations 
(A260V, C263R, P264L, P267S, E280A, E280G, A285V, L286V) are also 
located in the alternative splice domain (residues 257-290 of SEQ 
ID NO: 2) . 

25 All of these mutations can be assayed by a variety of 

strategies (direct nucleotide sequencing, allele specific 
oligonucleotides, ligation polymerase chain reaction, SSCP, 
RFLPs, new "DNA chip" technologies, etc.) using RT-PCR products 
representing the mature mRNA/cDNA sequence or genomic DNA. 

30 Finally, it should be noted that several polymorphisms with 

no apparent deleterious effect have also been discovered. One of 
these, a T-*G change of nucleotide 863 of SEQ ID NO: 1, causes a 
F205L polymorphism in TM4. Others (C-*A at bp 1700; Gr-#A at bp 
2603; deletion of bp 2620) are in the 30 UTR. 

35 B. PS2 Mutants 

Ti^e strong similarity between PSl and the PS? geije product 
raised the possibility that the PS2 gene might be the site of 
disease-causing mutations in some of a small nxamber of early 
onset AD pedigrees in which genetic linkage studies have excluded 

40 chromosomes 14, 19 and 21 • RT-PCR was used to isolate cDNAs 
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corresponding to the PS2 transcript from lyinphoblasts, 
fibroblasts or post-mortem brain tissue of affected members of 
eight pedigrees with early onset FAD in which mutations in the 
0APP and PSl genes had previously been excluded by direct 
5 sequencing studies. 

Examination of these RT-PCR products detected a heterozygous 
substitution at nucleotide 1080 in all four affected members 
of an extended pedigree of Italian origin (FlolO) with early 
onset, pathologically confirmed PAD (onset 50-70 yrs) . This 
10 mutation would be predicted to cause a Met^Val missense mutation 
at codon 239 in TM5. 

A second mutation (A-T at nucleotide 787) causing a Asn-.Ile 
substitution at codon 141 in TM2 was found in affected members of 
a group of related pedigrees of Volga German ancestry 
(represented by cell lines AG09369. AG09907, AG09952, and 
AG09905, coriell Institute, Camden N J) . Significantly, one 
subject (AG09907) was homozygous for this mutation, an 
observation compatible with the inbred nature of these pedigrees. 
Significantly, this subject did not have a significantly 
different clinical picture from those subjects heterozygous for 
the N141I mutation. Neither of the PS2 gene mutations were found 
in 284 normal Caucasian controls nor were they present in 
affected members of pedigrees with the AD3 type of AD. 

Both of these PS2 mutations would be predicted to cause 
substitution of residues which are highly conserved within the 
PSl /PS 2 gene family. 

An additional PS2 mutation is caused by a T-C substitution 
at base pair 1624 causing an He to Thr substitution at codon 420 
of the C-terminus. This mutation was found in an additional case 
of early onset (45 yrs) familial AD. 

These hPS2 mutations are listed in Table 5 with reference to 
their nucleotide locations in SEQ ID NO: 18 and with reference to 
their amino acid positions in SEQ ID NO: 19. An entry of "NA" in 
the table indicates that the data was not available. 

As discussed in the previous section, a number of Psl 
mutations have also been found. A comparison of the hPSl and 
hPS2 sequences is shown in Figure 4 and reveals that these 
pathogenic mutations are in regions of the PSl protein which are 
largely conserved in the PS2 protein. Therefore, corresponding 
mutations in the PS2 protein may also be expected to be 
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pathogenic and are included in the PS2 mutants provided and 
enabled herein. Furthermore, any pathogenic mutation identified 
in any conserved region of a presenilin gene may be presumed to 
represent a mutant of the other presenilins which share that 
5 conserved region. 

The finding of a gene whose product is predicted to share 
substantial amino acid and structural similarities with the PSl 
gene product suggests that these proteins may be functionally 
related as independent proteins with overlapping fxmctions but 
10 perhaps with slightly different specific activities, as 

physically associated subunits of a multimeric polypeptide or as 
independent proteins performing consecutive functions in the same 
pathway. 

The observation of three different missense mutations in 

15 conserved domains of the PS2 protein in subjects with a familial 
form of AD argues that these mutations are, like those in the PSl 
gene, causal to AD. This conclusion is significant because, 
while the disease phenotype associated with mutations in the PSl 
gene (onset 30-50 yrs, duration 10 yrs) is subtly different from 

20 that associated with mutations in the PS2 gene (onset 40-70 yrs; 
duration up to 20 yrs), the general similarities clearly argue 
that the biochemical pathway subsumed by members of this gene 
family is central to the genesis of at least early onset AD. The 
subtle differences in disease phenotype may reflect a lower level 

25 of expression of the PS2 transcript in the CNS, or may reflect a 
different role for the PS2 gene product. 

By analogy to the effects of PSl mutations, PS2 when mutated 
may cause aberrant processing of APP (Amyloid Precursor Protein) 
into A/3 peptide, hyperphosphorylation of Tau microtubule 

30 associated protein and abnormalities of intracellular calcium 
homeostasis. Interference with these anomalous interactions 
provides for therapeutic intervention in AD. 

Finally, at least one nucleotide polymorphism has been found 
in one normal individual whose PS2 cDNA had a T-^ change at bp 

35 626 of SEQ ID NO: 18, without any change in the encoded amino 
.-. acid sequence... .,. 

III. Preferred EmbodimentH 

Based, in part, upon the discoveries disclosed and described 
herein, the following preferred embodiments of the present 

40 invention are provided. 
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1. Isolated NuelP-if^ f^T^t*? 

In one series of embodiments, the present invention provides 
isolated nucleic acids corresponding to, or relating to, the 
presenilin nucleic acid sequences disclosed herein. As described 
more fully below, these sequences include normal PSl and PS2 
sequences from humans and other mammalian species, mutant PSl and 
PS2 sequences from humans and other mammalian species, homologous 
sequences from non-mammalian species such as Drosophiia and 
Sleaans, subsets of these sequences useful as probes and PCR 
primers, subsets of these sequences encoding fragments of the 
presenilin proteins or corresponding to particular structural 
domains or polymorphic regions, complementary or antisense 
sequences corresponding to fragments of the presenilin genes, 
sequences in which the presenilin coding regions have been 
operably joined to exogenous regulatory regions, and sequences 
encoding fusion proteins of the portions of the presenilin 
proteins fused to other proteins useful as markers of expression, 
as "tags" for purification, or in screens and assays for proteins 
interacting with the presenilins. 

Thus, in a first series of embodiments, isolated nucleic 
acid sequences are provided which encode normal or mutant 
versions of the PSl and PS2 proteins. Examples of such nucleic 
acid sequences are disclosed herein. These nucleic acids may be 
genomic sequences (e.g., SEQ ID NOs: 5-15) or may be cDNA 
25 sequences (e.g., SEQ ID NOs: 1, 3, 16, and 18). In addition, the 
nucleic acids may be recombinant genes or "minigenes" in which 
all or some of the introns Various combinations of the introns 
and exons and local siM. acting regulatory elements may be 
engineered in propagation or expression constructs or vectors. 
Thus, for example, the invention provides nucleic acid sequences 
in which the alternative splicing variations described herein are 
incorporated at the DNA level, thus enabling cells including 
these sequences to express only one of the alternative splice 
variants at each splice position. As an example, a recombinant 
gene may be produced in which the 3' end of Exon 1 of the PSl ■ 
gene (bp 1337 of SEQ id NO: 5) has. been joined directly to the 5' 
end of Exon 3 (bp 588 of SEQ ID NO: 6) so that only transcripts 
corresponding to the predominant transcript are produced. 
Obviously, one also may create a recombinant gene 6forcing6 the 
40 alternative splice of Exon 2 and Exon 3. Similarly, a 



30 



35 



CI IBC 1 I I • l^f f>t f ^1 



wo 96/34099 



PCT/CA96ffi0263 



- 46 - 



10 



recombinant gene may be produced in which one of the Exon 4 or 
Exon 9 splice variants of PSl (or the corresponding TM6-»7 splice 
variant of PS2) is incorporated into DNA such that cells 
including this recombinant gene can express only one of these 
variants. For purposes of reducing the size of a recombinant 
presenilin gene, a cDNA gene may be employed or various 
combinations of the introns and untranslated exons may be removed 
from a DNA construct. Finally, recombinant genes may be produced 
in which the 5' UTR is altered such that transcription proceeds 
necessarily from one or the other of the two transcription 
initiation sites. Such constructs may be particularly useful, as 
described below, in identifying compounds which can induce or 
repress the expression of the presenilins. Many variations on 
these embodiments are now enabled by the detailed description of 
15 the presenilin genes provided herein. 

In addition to the disclosed presenilin sequences, one of 
ordinary skill in the art is now enabled to identify and isolate 
nucleic acids representing presenilin genes or cDNAs which are 
allelic to the disclosed sequences or which are heterospecif ic 
20 homologues. Thus, the present invention provides isolated 

nucleic acids corresponding to these alleles and homologues, as 
well as the various above -described recombinant constructs 
derived from these sequences, by means which are well known in 
the art. Briefly, one of ordinary skill in the art may now 
25 screen preparations of genomic or cDNA, including samples 

prepared from individual organisms (e.g., human AD patients or 
their family members) as well as bacterial, viral, yeast or other 
libraries of genomic or cDNA, using probes or PGR primers to 
identify allelic or homologous sequences. Because it is 
30 desirable to identify additional presenilin gene mutations which 
may contribute to the development of AD or other disorders, 
because it is desirable to identify additional presenilin 
polymorphisms which are not pathogenic, and because it is also 
desired to create a variety of animal models which may be used to 
35 study AD and screen for potential therapeutics, it is 
.... particularly contemplated that additional presenilin sequences 
will be isolated from other preparations or libraries of human 
nucleic acids and from preparations or libraries from animals 
includinig rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, 
40 goats, sheep, pigs, and non- human primates. Furthermore, 
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presenilin homologues from yeast or invertebrate species, 
including C, ^;i,eqan? and other nematodes, as well as Drosophila 
and other insects, may have particular utility for drug 
screening. For example, invertebrates bearing mutant presenilin 
homologues (or mammalian presenilin transgenes) which cause a 
rapidly occurring and easily scored phenotype (e.g., abnormal 
vulva or eye development after several days) can be used as 
screens for drugs which block the effect of the mutant gene. 
Such invertebrates may prove far more rapid and efficient for 
mass screenings than larger vertebrate animals. Once lead 
compounds are found through such screens, they may be tested in 
higher animals. 

Standard hybridization screening or PGR techniques may be 
employed (as used, for example, in the identification of the mPSl 
15 gene) to identify and/or isolate such allelic and homologous 

sequences using relatively short presenilin gene sequences. The 
sequences may include 8 or fewer nucleotides depending upon the 
nature of the target sequences, the method employed, and the 
specificity required. Future technological developments may 
20 allow the advantageous use of even shorter sequences. With 

current technology, sequences of 9-50 nucleotides, and preferably 
about 18-24 are preferred. These sequences may be chosen from 
those disclosed herein, or may be derived from other allelic or 
heterospecific homologues enabled herein. When probing mRNA or 
25 screening cDNA libraries, probes and primers from coding 

sequences (rather than introns) are preferably employed, and 
sequences which are omitted in alternative splice variants 
typically are avoided unless it is specifically desired to 
identify those variants. Allelic variants of the presenilin 
30 genes may be expected to hybridize to the disclosed sequences 
under stringent hybridization conditions, as defined herein, 
whereas lower stringency may be employed to identify 
heterospecific homologues . 

In another series of embodiments, the present invention 
35 provides for isolated nucleic acids which include subsets of the 
presenilin sequences or their, cpinpleirients. As noted above, such 
sequences will have utility as probes and PGR primers in the 
identification and isolation of allelic and homologous variants 
of the presenilin genes. Subsequences corresponding to the 
polymorphic regions of the preaenilins, as described above* will 
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also have particular utility in screening and/or genotyping 
individuals for diagnostic purposes, as described below, in 
addition, suid also as described below, such subsets will have 
utility for encoding (1) fragments of the presenilin proteins for 
5 inclusion in fusion proteins, (2) fragments which comprise 

functional domains of the presenilin proteins for use in binding 
studies, (3) fragments of the presenilin proteins which may be 
used as immunogens to raise antibodies against the presenilin 
proteins, and (4) fragments of the presenilins which may act as 

10 competitive inhibitors or as mimetics of the presenilins to 

inhibit or mimic their physiological functions. Finally, such 
subsets may encode or represent complementary or antisense 
sequences which ceui hybridize to the presenilin genes or 
presenilin mRNA transcripts under physiological conditions to 

15 inhibit the transcription or translation of those sequences. 

Therefore, depending upon the intended use, the present invention 
provides nucleic acid subsequences of the presenilin genes which 
may have lengths varying from 8-10 nucleotides (e.g., for use as 
PGR primers) to nearly the full size of the presenilin genomic or 

20 cDNAs. Thus, the present invention provides isolated nucleic 
acids comprising sequences corresponding to at least 8-10, 
preferably 15, and more preferably at least 20 consecutive 
nucleotides of the presenilin genes, as disclosed or otherwise 
enabled herein, or to their complements. As noted above, 

25 however, shorter sequences may be useful with different 
technologies . 

In another series of embodiments, the present invention 
provides nucleic acids in which the presenilin coding sequences, 
with or without introns or recombinantly engineered as described 

30 above, are operably joined to endogenous or exogenous 5' and/or 
3' regulatory regions. The endogenous regulatory regions of the 
hPSi gene are described and disclosed in detail herein. Using 
the present disclosure and standard genetic techniques (e.g., PGR 
extensions, targeting gene walking) , one of ordinary skill in the 

3 5 art is also how enabled to clone the corresponding hPS2 5' and/or 
3' endogenous regulatory regions. Similarly, allelic .variants of 
the hPSl and hPS2 endogenous regulatory regions, as wells as 
endogenous regulatory regions from other mammalian homologues. 
are similarly enabled without undue experimentation. 

40 Alternatively, exogenous regulatory regions (i.e., regulatory 
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regions from a different conspecific gene or a heterospecif ic 
regulatory region) may be operably joined to the presenilin 
coding sequences in order to drive expression, -J^propriate 5' 
regulatory regions will include promoter elements and may also 
5 include additional elements such as operator or enhancer 

sequences, ribosome binding sequences, RNA capping sequences, and 
the like. The regulatory region may be selected from sequences 
that control the expression of genes of prokaryotic or eukaryotic 
cells, their viruses, and combinations thereof. Such regulatory 
10 regions include, but are not limited to, the lac system, the trp 
system, the tac system, and the trc system; major operator and 
promoter regions of phage X; the control region of the fd coat 
protein; early and late promoters of SV40; promoters derived from 
polyoma, adenovirus, retrovirus, baculovirus, and simian virus; 
15 3-phosphoglycerate kinase promoter; yeast acid phosphatase 

promoters; yeast alpha-mating factors; promoter elements of other 
eukaryotic genes expressed in neurons or other cell types; and 
combinations thereof. In particular, regulatory elements may be 
chosen which are inducible or repressible (e.g., the 0- 
20 galactosidase promoter) to allow for controlled and/or 
manipulable expression of the presenilin genes in cells 
transformed with these nucleic acids. Alternatively, the 
presenilin coding regions may be operably joined with regulatory 
elements which provide for tissue specific expression in 
25 multicellular organisms. Such constructs are particularly useful 
for the production of transgenic organisms to cause expression of 
the presenilin genes only in appropriate tissues. The choice of 
appropriate regulatory regions is within the ability and 
discretion of one of ordinary skill in the art and the 
3 0 recombinant use of many such regulatory regions is now 
established in the art. 

In another series of embodiments, the present invention 
provides for isolated nucleic acids encoding all or a portion of 
the presenilin proteins in the form of a fusion protein. in 
35 these embodiments, a nucleic acid regulatory region (endogenous 
or exogenous) is operably joined to a first coding region which 
is covalently joined in- frame to a second coding region. The 
second coding region optionally may be covalently joined to one 
or more additional coding regions and the last coding region is 
40 joined to a termination codon and, optionally, appropriate 3' 
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regulatory regions (e.g., polyadenylat ion signals) . The 
presenilin sequences of the fusion protein may represent the 
first, second, or any additional coding regions. The presenilin 
sequences may be conserved or non-conserved domains and can be 
5 placed in any coding region of the fusion. The non-presenilin 
sequences of the fusion may be chosen according to the needs and 
discretion of the practitioner and are not limited by the present 
invention. Useful non-presenilin sequences include, however 
short sequence "tags" such as antigenic determinants or poly-His 
10 tags which may be used to aid in the identification or 

purification of the resultant fusion protein. Alternatively, the 
non-presenilin coding region may encode a large protein or 
protein fragment, such as an enzyme or binding protein which also 
may assist in the identification and purification of the protein 
or which may be useful in an assay such as those described below' 
Particularly contemplated presenilin fusion proteins include 
poly-His and GST (glutathione S- transferase) fusions which are 
useful in isolating and purifying the presenilins, and the yeast 
two hybrid fusions, described below, which are useful in assays 
to identify other proteins which bind to or interact with the 
presenilins. 

In another series of embodiments, the present invention 
provides isolated nucleic acids in the form of recombinant DNA 
constructs in which a marker or reporter gene (e.g., 
galactosidase. lucif erase) is operably joined to the 5' 
regulatory region of a presenilin gene such that expression of 
the marker gene is under the control of the presenilin regulatory 
sequences. Using the presenilin regulatory regions disclosed or 
otherwise enabled herein, including regulatory regions from PSl 
and PS2 genes from human and other mammalian species, one of 
ordinary skill in the art is now enabled to produce such 
constructs. As discussed more fully below, such isolated nucleic 
acids may be used to produce cells, cell lines or transgenic 
animals which are useful in the identification of compounds which 
can, directly or indirectly, differentially affect the expression 
of the presenilins. 

Finally, the isolated nucleic acids of the present invention 
include any of the above described sequences when included in 
vectors. Appropriate vectors include cloning vectors and 
expression vectors of an types, including plasmids, phagemids. 
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cosraids, episomes, and the like, as well as integration vectors. 
The vectors may also include various marker genes (e.g., 
antibiotic resistance or susceptibility genes) which are useful 
in identifying cells successfully transformed therewith. In 
5 addition, the vectors may include regulatory sequences to which 
the nucleic acids of the invention are operably joined, and/or 
may also include coding regions such that the nucleic acids of 
the invention, when appropriately ligated into the vector, are 
expressed as fusion proteins. Such vectors may also include 
10 vectors for use in yeast "two hybrid," baculovirus, and phage- 
display systems. The vectors may be chosen to be useful for 
prokaryotic, eukaryotic or viral expression, as needed or desired 
for the particular application. For example, vaccinia virus 
vectors or simian virus vectors with the SV40 promoter (e.g., 
15 pSV2), or Herpes simplex virus or adeno-associated virus may be 
useful for transfection of mammalian cells including neurons in 
culture or in vjvo, and the baculovirus vectors may be used in 
transfecting insect cells (e.g., butterfly cells) . A great 
variety of different vectors are now commercially available and 
otherwise known in the art, and the choice of an appropriate 
vector is within the ability and discretion of one of ordinary 
skill in the art. 
2. Substantially Pure Pr»t^finff 

The present invention provides for substantially pure 
25 preparations of the presenilin proteins, fragments of the 
presenilin proteins, and fusion proteins including the 
presenilins or fragments thereof. The proteins, fragments and 
fusions have utility, as described herein, in the generation of 
antibodies to normal and mutant presenilins, in the 
identification of presenilin binding proteins, and in diagnostic 
and therapeutic methods. Therefore, depending upon the intended 
use, the present invention provides substantially pure proteins 
or peptides comprising amino acid sequences which are 
subsequences of the complete presenilin proteins and which may 
have lengths varying from 4-10 amino acids (e.g., for use as 
immunogens) , or 10-100 amino jacids (e.g. , for use in binding 
assays), to the complete presenilin proteins. Thus, the present 
invention provides substantially pure proteins or peptides 
comprising sequences corresponding to at least 4-5, preferably 6- 
10, and more preferably at least 50 or, 100 consecutive amino 
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acids of the presenilin proteins, as disclosed or otherwise 
enabled herein. 

The proteins or peptides of the invention may be isolated 
and purified by any of a variety of methods selected on the basis 
5 of the properties revealed by their protein sequences. Because 
the presenilins possess properties of integral or membrane - 
spanning proteins, a membrane fraction of cells in which the 
presenilin is normally highly expressed (e.g., neurons, 
oligodendroglia, muscle, pancreas) may be isolated and the 
10 proteins extracted by, for example, detergent solubilization. 

Alternatively the presenilin protein, fusion protein, or fragment 
thereof, may be purified from cells transformed or transfected 
with expression vectors (e.g., baculovirus systems such as the 
pPbac and pMbac vectors (Stratagene, La Jolla, CA) ; yeast 
15 expression systems such as the pYESHIS Xpress vectors 

(Invitrogen, San Diego, CA) ; eukaryotic expression systems such 
as PCDNA3 (Invitrogen, San Diego, CA) which has constant 
constitutive expression, or LacSwitch (Stratagene, La Jolla, CA) 
which is inducible; or prokaryotic expression vectors such as 
20 PKK233-3 (Clontech, Palo Alto, CA) . In the event that the 

protein or fragment integrates into the endoplasmic reticulum or 
plasma membrane of the recombinant cells (e.g., immortalized 
human cell lines or other eukaryotic cells) , the protein may be 
purified from the membrane fraction. Alternatively, if the 
protein is not properly localized or aggregates in inclusion 
bodies within the recombinant cells (e.g., prokaryotic cells), 
the protein may be purified from whole lysed cells or from 
solubilized inclusion bodies. 

Purification can be achieved using standard protein 
purification procedures including, but not limited to, gel- 
filtration chromatography, ion-exchange chromatography, high- 
performance liquid chromatography (RP-HPLC, ion-exchange HPLC, 
size-exclusion HPLC, high-performance chromatof ocusing 
chromatography, hydrophobic interaction chromatography, 
35 inmunoprecipitation, or innnunoaffinity purification. Gel 
. electrophoresis (e.g., PAGE, sps- PAGE) can also be .used to 
isolate a protein or peptide based on its molecular weight, 
charge properties and hydrophobicity. 

A presenilin protein, or a fragment thereof, may also be 
conveniently purified by creating a fusion protein including the . 
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desired presenilin sequence fused to another peptide such as an 
antigenic determinant or poly-His tag (e.g.. QlAexpress vectors, 
QIAGEN Corp., chatsworth, CA) , or a larger protein (e.g., GST 
using the pGEX-27 vector (Amrad, USA) or green fluorescent 
5 protein using the Green Lantern vector (GIBCO/BRL. Gaithersburg, 
MD) . The fusion protein may be expressed and recovered from 
prokaryotic or eukaryotic cells and purified by any standard 
method based upon the fusion vector sequence. For example, the 
fusion protein may be purified by immunoaff inity or 
10 immunoprecipitation with an antibody to the non-presenilin 
portion of the fusion or, in the case of a poly-His tag. by 
affinity binding to a nickel column. The desired presenilin 
protein or fragment can then be further purified from the fusion 
protein by enzymatic cleavage of the fusion protein. Methods for 
15 preparing and using such fusion constructs for the purification 
of proteins are well known in the art and several kits are now 
commercially available for this purpose, in light of the present 
disclosure, one is now enabled to en?)loy such fusion constructs 
with the presenilins. 
20 3. Antibodies to the Pri.R^n^1|po 

The present invention also provides antibodies, and methods 
of making antibodies, which selectively bind to the presenilin 
proteins or fragments thereof. Of particular importance, by 
identifying the functional domains of the presenilins and the 
25 polymorphic regions associated with AD, the present invention 

provides antibodies, and methods of making antibodies, which will 
selectively bind to and, thereby, identify and/or distinguish 
normal and mutant (i.e., pathogenic) forms of the presenilin 
proteins. The antibodies of the invention have utility as 
30 laboratory reagents for. ^ntgr a^jja, immunoaff inity purification 
of the presenilins. Western blotting to identify cells or tissues 
expressing the presenilins, and iramunocytochemistry or 
immunofluorescence techniques to establish the subcellular 
location of the protein, in addition, as described below, the 
J5 antibodies of the invention may be used as diagnostics tools to 
identify carriers of AD-related presenilin alleles, as 
therapeutic tools to selectively bind and inhibit pathogenic 
forms of the presenilin proteins in vivo . 

The antibodies of the invention may be generated using the 
10 entire presenilin proteins of the invention or using any 
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presenilin epitope which is characteristic of that protein and 
which sxibstantially distinguishes it from other host proteins. 
Such epitopes may be identified by comparing sequences of, for 
example. 4-10 amino acid residues from a presenilin sequence to 
5 computer databases of protein sequences from the relevant host. 
Preferably, the epitopes are chosen from the N- and C- termini, or 
from the loop domains which connect the transmembrane domains of 
the proteins. In particular, antibodies to the polymorphic N- 
terminal region, TMl-2 loop, or TM6-7 loop are expected to have 
10 the greatest utility both diagnostically and therapeutically. On 
the other hand, antibodies against highly conserved domains are 
expected to have the greatest utility for purification or 
identification of presenilins. 

Using the IBI Pustell program, amino acid residue positions 
15 were identified as potential antigenic sites in the hPSi protein 
and may be useful in generating the antibodies of the invention. 
These positions, corresponding to positions in SEQ ID NO: 2, are 
listed in Table 6. 

Other methods of choosing antigenic determinants may, of 
20 course, are known in the art and be employed. In addition, 
larger fragments (e.g.. 8-20 or, preferably, 9-15 residues) 
including some of these epitopes may also be employed. For 
example, a fragment including the 109-112 epitope may comprise 
residues 107-114. or 105-116. Even larger fragments, including 
25 for example entire functional domains or multiple function 

domains (e.g.. TMl, TMl-»2, and TM2 or TM6, TM6-.7, and TM7) may 
also be preferred. For other presenilin proteins (e.g., for mPSl 
or other non-human homologues, or for PS2> , homologous sites may 
be chosen. 

30 using the same IBI Pustell program, amino acid residue 

positions were identified as potential antigenic sites in the 
hPS2 protein and may be useful in generating the antibodies of 
the invention. These positions, corresponding to positions in 
SEQ ID NO: 19, are listed in Table 7. 

35 AS for PSl, other methods of choosing antigenic determinants 

. may, of course, are known in. the art and be employed. In 
addition, larger fragments (e.g., 8-20 or, preferably, 9-15 
residues) including some of these epitopes may also be employed. 
For example, a fragment including the 310-314 epitope may 

40 comprise residues 308-316, or 307-317. Even larger fragments. 
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including for example entire functional domains or multiple 
function domains (e.g., TMl, TMl-2, and TM2 or TM6, TM6-7, and 
TM7) may also be preferred. For other presenilin proteins (e.g.. 
for mPS2 or other non-human homologues, or for PSl) . homologous 
5 sites may be chosen. 

Presenilin immunogen preparations may be produced from crude 
extracts (e.g., membrane fractions of cells highly expressing the 
proteins) , from proteins or peptides substantially purified from 
cells which naturally or recombinantly express them or, for short 
10 immunogens, by chemical peptide synthesis. The presenilin 

iramunogens may also be in the form of a fusion protein in which 
the non-presenilin region is chosen for its adjuvant properties. 
As used herein, a presenilin immtmogen shall be defined as a 
preparation including a peptide comprising at least 4-8, and 
preferably at least 9-15 consecutive amino acid residues of the 
presenilin proteins, as disclosed or otherwise enabled herein. 
Sequences of fewer residues may, of course, also have utility 
depending upon the intended use and future technological 
developments. Therefore, any presenilin derived sequences which 
are employed to generate antibodies to the presenilins should be 
regarded as presenilin immunogens. 

The antibodies of the invention may be polyclonal or 
monoclonal, or may be antibody fragments, including Fab 
fragments, F(ab'),, and single chain antibody fragments. In 
addition, after identifying useful antibodies by the method of 
the invention, recombinant antibodies may be generated, including 
any of the antibody fragments listed above, as well as humanized 
antibodies based upon non-human antibodies to the presenilin 
proteins, in light of the present disclosures of presenilin 
30 proteins, as well as the characterization of other presenilins 
enabled herein, one of ordinary skill in the art may produce the 
above-described antibodies by any of a variety of standard means 
well known in the art. For an overview of antibody techniques, 
see Antibody Enaineerinrrr a Pr;.r^iT -f ^i r rVi-l" Borrebaek. ed. , W.H 
Freeman & Company, NY (1992) , or Antibody Bn^^no^T-Inq 2hd Ed. , 
Borrebaek, ed., Oxford University Press, Oxford (1995) . 

As a general matter, polyclonal antibodies may be generated 
by first immunizing a mouse, rabbit, goat or other suitable 
animal with the presenilin immunogen in a suitable carrier. To 
increase the immunogenicity of the preparation, the immunogen may. 
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be coupled to a carrier protein or mixed with an adjuvant (e.g., 
Freund's adjuvant). Booster injections, although not necessary 
are recommended. After an appropriate period to allow for the 
development of a humoral response, preferably several weeks, the 
5 animals may be bled and the sera may be purified to isolate the 
immunoglobulin component. 

Similarly, as a general matter, monoclonal anti-presenilin 
antibodies may be produced by first injecting a mouse, rabbit, 
goat or other suitable animal with a presenilin immunogen in a 
10 suitable carrier. As above, carrier proteins or adjuvants may be 
utilized and booster injections (e.g., bi- or tri-weekly over 8- 
10 weeks) are recommended. After allowing for development of a 
humoral response, the animals are sacrificed and their spleens 
are removed and resuspended in, for example, phosphate buffered 
15 saline (PBS). The spleen cells serve as a source of lymphocytes, 
some of which are producing antibody of the appropriate 
specificity. These cells are then fused with an immortalized 
cell line (e.g., myeloma), and the products of the fusion are 
plated into a number of tissue culture wells in the presence of a 
20 selective agent such as HAT. The wells are serially screened and 
replated. each time selecting cells making useful antibody. 
Typically, several screening and replating procedures are carried 
out until over 90% of the wells contain single clones which are 
positive for antibody production. Monoclonal antibodies produced 
25 by such clones may be purified by standard methods such as 
affinity chromatography using Protein A Sepharose, by ion- 
exchange chromatography, or by variations and combinations of 
these techni<jues. 

The antibodies of the invention may be labelled or 
30 conjugated with other compounds or materials for diagnostic 
and/or therapeutic uses. For example, they may be coupled to 
radionuclides, fluorescent compounds, or enzymes for imaging or 
therapy, or to liposomes for the targeting of compounds contained 
in the liposomes to a specific tissue location. 
35 4.. Transformed Cell Lin^H 

The present invention also provides for cells or cell lines, 
both prokaryotic and eukaryotic, which have been transformed or 
transfected with the nucleic acids of the present invention so as 
to cause clonal propagation of those nucleic acids and/or 
expression of the proteins or peptides encoded thereby/ Such 
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cells or cell lines will have utility both in the propagation and 
production of the nucleic acids and proteins of the present 
invention but also, as further described herein, as model systems 
for diagnostic and therapeutic assays. As used herein, the term 
"transformed cell" is intended to embrace any cell, or the 
descendant of any cell, into which has been introduced any of the 
nucleic acids of the invention, whether by transformation, 
transfection, infection, or other means. Methods of producing 
appropriate vectors, transforming cells with those vectors, and 
identifying transformante are well known in the art and are only 
briefly reviewed here (see, for example, Sambrook et al. (1989) 
MP i ^ecular Cloning; a Laboratnr-v 2nd ed. , Cold Spring 

Harbor Laboratory Press. Cold Spring Harbor, New York) . 

Prokaryotic cells useful for producing the transformed cells 
of the invention include members of the bacterial genera 
figCheriqh^^ (e.g.. g, ggU ) , PggtiqpmgnaP (e.g.. P. aerucrinns^ ^ 
and Bagil^yg (e.g., B. gubti^l^m, B. 8tearothP rmophilu« ) . as well 
as many others well known and frequently used in the art. 
Prokaryotic cells are particularly useful for the production of 
large quantities of the proteins or peptides of the invention 
(e.g., normal or mutant presenilins, fragments of the 
presenilins. fusion proteins of the presenilins). Bacterial 
cells (e.g., E, ggW may be used with a variety of expression 
vector systems including, for example, plasmids with the T7 rna 
polymerase/promoter system, bacteriophage X regulatory sequences, 
or M13 Phage mGPI-2. Bacterial hosts may also be transformed 
with fusion protein vectors which create, for example. lacZ. 
trpE. maltose-binding protein. poly-His tags, or glutathione -S- 
transf erase fusion proteins. All of these, as well as many other 
30 prokaryotic expression systems, are well known in the art and 

widely available commercially (e.g., pGEX-27 (Amrad, USA) for GST 
fusions) . 

Eukaryotic cells and cell lines useful for producing the 
transformed cells of the invention include mammalian cells and 
35 cell lines (e.g., PC12, COS. CHO. fibroblasts, myelomas, 

ijeuroblastpmas. hybridpmae, human embryonic kidney 293, oocytes, 
embryonic stem cells), insect cells lines (e.g., using 
baculovirus vectors such as pPbac or pMbac (Stratagene, La Jolla, 
CA)), yeast (e.g., using yeast expression vectors such as pYESHIS 
(Invitrogen. CA) ) , and fungi. Eukaryotic cells are particularly 
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useful for embodiments in which it is necessary that the 
presenilin proteins, or functional fragments thereof, perform the 
functions and/or imdergo the intracellular interactions 
associated with either the normal or mutant proteins. Thus, for 
5 example, transformed eukaryotic cells are preferred for use as 
models of presenilin function or interaction, and assays for 
screening candidate therapeutics preferably employ transformed 
eukaryotic cells. 

To accomplish expression in eukaryotic cells, a wide variety 

10 of vectors have been developed and are commercially available 
which allow inducible (e.g., LacSwitch expression vectors, 
Stratagene, La Jolla, CA) or cognate (e.g., pcDNAB vectors, 
Invitrogen, Chatsworth, CA) expression of presenilin nucleotide 
sequences xinder the regulation of an artificial promoter element. 

15 Such promoter elements are often derived from CMV or SV40 viral 
genes, although other strong promoter elements which are active 
in eukaryotic cells can also be employed to induce transcription 
of presenilin nucleotide sequences. Typically, these vectors 
also contain an artificial polyadenylation sequence and 3' UTR 

20 which can also be derived from exogenous viral gene sequences or 
from other eukaryotic genes. Furthermore, in some constructs, 
artificial, non-coding, spliceable introns and exons are included 
in the vector to enhance expression of the nucleotide sequence of 
interest (in this case, presenilin sequences) . These expression 

25 systems are commonly available from commercial sources and are 
typified by vectors such as pcDNA3 and pZeoSV (Invitrogen, San 
Diego, CA) , Both of the latter vectors have been successfully 
used to cause expression of presenilin proteins in transfected 
COS, CHO, and PC12 cells (Levesque et al. 1996). Innumerable 

30 commercially-available as well as custom-designed expression 

vectors are available from commercial sources to allow expression 
of any desired presenilin transcript in more or less any desired 
cell type, either const itutively or after exposure to a certain 
exogenous stimulus (e.g., withdrawal of tetracycline or exposure 

35 to IPTG) . 

Vectors may be introduced into the. recipient or ''host" cells 
by various methods well known in the art including, but not 
limited to, calcium phosphate transf ection, strontium phosphate 
transfection, DEAE dextran transfection, elect roporat ion, 
40 lipofection (e.g. , Dosper Liposomal transfection reagent, 
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Boehringer Mannheim, Germany) , microinjection, ballistic 
insertion on micro-beads, protoplast fusion or, for viral or 
phage vectors, by infection with the recombinant virus or phage. 
5. Transgenic Animal Model r 
5 The present invention also provides for the production of 

transgenic non-human animal models for the study of Alzheimer's 
Disease, for the screening of candidate pharmaceutical compounds, 
for the creation of explanted mammalian CNS cell cultures (e.g., 
neuronal, glial, organotypic or mixed cell cultures) in which 
mutant or wild type presenilin sequences are expressed or in 
which the presenilin genes has been inactivated {e.g., »laioc}c- 
out" deletions), and for the evaluation of potential therapeutic 
interventions. Prior to the present invention, a partial animal 
model for Alzheimer's Disease existed via the insertion and over- 
expression of a mutant form of the human amyloid precursor 
protein gene as a minigene under the regulation of the platelet - 
derived growth factor 0 receptor promoter element (Games et al., 
1995) . This mutant (/SAPP,„ Val.»ile) causes the appearance of 
synaptic pathology and amyloid 0 peptide deposition in the brain 
of transgenic animals bearing this transgene in high copy number. 
These changes in the brain of the transgenic animal are very 
similar to that seen in human AD (Games et al., 1995). it is, 
however, as yet unclear whether these animals become demented,' 
but there is general consensus that it is now possible to 
recreate at least some aspects of AD in mice. 

Animal species which suitable for use in the animal models 
of the present invention include, but are not limited to. rats, 
mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, 
pigs, and non-human primates (e.g.. Rhesus monkeys, chimpanzees). 
30 For initial studies, transgenic rodents (e.g., mice) are 

preferred due to their relative ease of maintenance and" shorter 
life spans. Indeed, as noted above, transgenic yeast or 
invertebrates (e.g., nematodes, insects) may be preferred for 
some studies because they will allow for even more rapid and 
35 inexpensive screening. Transgenic non- human primates, however, 
.^« .:P^«^«»^'^e'i longer term studies due to their greater 
similarity to humans and their higher cognitive abilities. 

Using the nucleic acids disclosed and otherwise enabled 
herein, there are now several available approaches for the 
creation of a transgenic animal model for Alzheimer's Disease. 
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Thus, the enabled animal models include (1) Animals in which a 
normal human presenilin gene has been recombinantly introduced 
into the genome of the animal as an additional gene, under the 
regulation of either an exogenous or an endogenous promoter 
5 element, and as either a minigene or a large genomic fragment; in 
which a normal human presenilin gene has been recombinantly 
siibstituted for one or both copies of the animal's homologous 
presenilin gene by homologous recombination or gene targeting; 
and/or in which one or both copies of one of the animal's 

10 homologous presenilin genes have been recombinantly "humanized" 
by the partial substitution of sequences encoding the human 
homologue by homologous recombination or gene targeting . These 
animals are useful for evaluating the effects of the transgenic 
procedures, and the effects of the introduction or substitution 

15 of a human or humanized presenilin gene. (2) Animals in which a 
mutant (i.e., pathogenic) human presenilin gene has been 
recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or 
an endogenous promoter element, and as either a minigene or a 

20 large genomic fragment; in which a mutant human presenilin gene 
has been recombinantly substituted for one or both copies of the 
animal's homologous presenilin gene by homologous recombination 
or gene targeting; and/or in which one or both copies of one of 
the animal's homologous presenilin genes have been recombinantly 

25 "humanized" by the partial substitution of sequences encoding a 
mutant human homologue by homologous recombination or gene 
targeting. These animals are useful as models which will display 
some or all of the characteristics, whether at the biochemical, 
physiological and/or behavioral level, of humans carrying one or 

30 more alleles which are pathogenic of Alzheimer's Disease or other 
diseases associated with mutations in the presenilin genes. (3) 
Animals in which a mutant version of one of that euiimal's 
presenilin genes (bearing, for example, a specific mutation 
corresponding to, or similar to, one of the pathogenic mutations 

35 of the human presenilins) has been recombinantly introduced into 
the genome of the animal as an additional gene, under the 
regulation of either an exogenous or an endogenous promoter 
element, and as either a minigene or a large genomic fragment; 
and/or in which a mutant version of one of that auiimal's 

40 presenilin genes (bearing, for exan?>le, a specific mutation 
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corresponding to, or similar to, one of the pathogenic mutations 
of the human presenilins) has been recombinant ly substituted for 
one or both copies of the animal's homologous presenilin gene by 
homologous recombination or gene targeting. These animals are 
5 also useful as models which will display some or all of the 

characteristics, whether at the biochemical, physiological and/or 
behavioral level, of humans carrying one or more alleles which 
are pathogenic of Alzheimer's Disease. (4) "Knock-out" animals 
in which one or both copies of one of the animal's presenilin 
10 genes have been partially or completely deleted by homologous 

recombination or gene targeting, or have been inactivated by the 
insertion or substitution by homologous recombination or gene 
targeting of exogenous sequences (e.g., stop codons, lox p 
sites) , Such animals are useful models to study the effects 
15 which loss of presenilin gene expression may have, to evaluate 

whether loss of function is preferable to continued expression of 
mutant forms, and to examine whether other genes can be recruited 
to replace a mutant presenilin (e.g., substitute PSl with PS2) or 
to intervene with the effects of other genes (e.g., APP or ApoE) 
20 causing AD as a treatment for AD or other disorders. For 

example, a normal presenilin gene may be necessary for the action 
of mutant APP genes to actually be expressed as AD and, 
therefore, transgenic presenilin animal models may be of use in 
elucidating such multigenic interactions. 
2^ create an animal model (e.g., a transgenic mouse), a 

normal or mutant presenilin gene (e.g., normal or mutant hPSl, 
mPSl, hPS2, mPS2, etc.), or a normal or mutant version of a 
recombinant nucleic acid encoding at least a functional domain of 
a presenilin (e.g., a recombinant construct comprising an mPSl 
30 sequence into which has been substituted a nucleotide sequence 
corresponding to a human mutant sequence) can be inserted into a 
germ line or stem cell using standard techniques of oocyte 
microinjection, or transfection or microinjection into embryonic 
stem cells. Animals produced by these or similar processes are 
35 referred to as transgenic. Similarly, if it is desired to 

inactivate or replace an endogenous presenilin gene, homologous 
recombination using embryonic stem cells may be employed. 
Animals produced by these or similar processes are referred to as 
"knock-out" (inactivation) or "knock-in" (replacement) models. 
40 For oocyte injection, one or more copies of the recombinant 
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DNA constructs of the present invention may be inserted into the 
pronucleus of a just-fertilized oocyte. This oocyte is then 
reimplanted into a pseudo-pregnant foster mother. The livebom 
animals are screened for integrants using analysis of DNA (e.g., 
5 from the tail veins of offspring mice) for the presence of the 
inserted recombinant transgene sequences. The transgene may be 
either a complete genomic sequence injected as a YAC, BAG, PAC or 
other chromosome DNA fragment, a cDNA with either the natural 
promoter or a heterologous promoter, or a minigene containing all 

10 of the coding region and other elements foxind to be necessary for 
optimum expression. 

Retroviral infection of early embryos can also be done to 
insert the recombinant DNA constructs of the invention. In this 
method, the transgene (e.g., a normal or mutant hPSi or PS2 

15 sequence) is inserted into a retroviral vector which is used to 
infect embryos (e.g., mouse or non-human primate embryos) 
directly during the early stages of development to generate 
chimeras, some of which will lead to germline transmission. 

Homologous recombination using stem cells allows for the 

20 screening of gene transfer cells to identify the rare homologous 
recombination events. Once identified, these can be used to 
generate chimeras by injection of blastocysts, and a proportion 
of the resulting animals will show germline transmission from the 
recombinant line. This methodology is especially useful if 

25 inactivation of a presenilin gene is desired. For example, 
inactivation of the mPSl gene in mice may be accomplished by 
designing a DNA fragment which contains sequences from an mPSl 
exon flanking a selectable marker. Homologous recombination 
leads to the insertion of the marker sequences in the middle of 

30 an exon, causing inactivation of the mPSl gene and/or deletion of 
internal sequences. DNA analysis of individual clones can then 
be used to recognize the homologous recombination events. 

The techniques of generating transgenic animals, as well as 
the techniques for homologous recombination or gene targeting, 

35 are now widely accepted and practiced. A laboratory manual on 
the manipulation of the mouse embryo, for example, is available 
detailing standard laboratory techniques for the production of 
transgenic mice (Hogan et al., 1986). To create a transgene, the 
target sequence of interest (e.g., mutant or wild- type presenilin 

40 sequences) are typically ligated into a cloning site located 
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downstream of some promoter element which will regulate the 
expression of RNA from the presenilin sequence. Downstream of 
the presenilin sequence, there is typically an artificial 
polyadenylation sequence. In the transgenic models that have 
5 been used to successfully create coiimals which mimic aspects of 
inherited human neurodegenerative diseases, the most successful 
promoter elements have been the platelet-derived growth factor 
receptor /3 gene sxibxinit promoter and the hamster prion protein 
gene promoter, although other promoter elements which direct 

10 expression in central nervous system cells would also be useful. 
An alternate approach to creating a transgene is to use an 
endogenous presenilin promoter and regulatory sequences to drive 
expression of the presenilin transgene. Finally, it is possible 
to create transgenes using large genomic DNA fragments such as 

15 YACs which contain the entire presenilin gene as well as its 
appropriate regulatory sequences. Such constructs have been 
successfully used to drive human APP expression in transgenic 
mice (Lamb et al., 1993). 

Animal models can also be created by targeting the 

20 endogenous presenilin gene in order to alter the endogenous 

presenilin sequence by homologous recombination. These targeting 
events can have the effect of removing endogenous sequence 
(knock-out) or altering the endogenous sec^ience to create an 
amino acid chcuige associated with human disease or an otherwise 

25 abnormal sequence (e.g., a sequence which is more like the human 
sequence than the original animal sequence) (knock-in animal 
models) . A large number vectors are available to accomplish this 
and appropriate sources of genomic DNA for mouse and other animal 
genomes to be targeted are commercially available from companies 

30 such as GenomeSystems Inc. (St. Louis, Missouri, USA). The 

typical feature of these targeting vector constructs is that 2 to 
4 kb of genomic DNA is ligated 5' to a selectable marker (e,g., a 
bacterial neomycin resistance gene under its own promoter element 
termed a ^neomycin cassette"). A second DNA fragment from the 

35 gene of interest is then ligated downstream of the neomycin 
cassette biit upstream of a second eelectabXe marker (e.g. , 
thymidine kinase) . The DNA fragments are chosen such that mutant 
sequences can be introduced into the germ line of the targeted 
animal by homologous replacement of the endogenous sequences by 

40 either one of the . sequences inqluded in the vector. 
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Alternatively, the sequences can be chosen to cause deletion of 
sequences that would normally reside between the left and right 
arras of the vector surrounding the neomycin cassette. The former 
is known as a knock- in, the latter is known as a knock-out. 
Again, innumerable model systems have been created, particularly 
for targeted knock-outs of genes including those relevant to 
neurodegenerative diseases (e.g., targeted deletions of the 
murine APP gene by Zheng et al., 1995; targeted deletion of the 
murine prion gene associated with adult onset human CNS 
degeneration by Bueler et al., 1996). 

Finally, equivalents of transgenic animals, including 
animals with mutated or inactivated presenilin genes, may be 
produced using chemical or x-ray mutagenesis of gametes, followed 
by fertilization. Using the isolated nucleic acids disclosed or 
15 otherwise enabled herein, one of ordinary skill may more rapidly 
screen the resulting offspring by, for example, direct sequencing 
RFLP, PCR, or hybridization analysis to detect mutants, or 
Southern blotting to demonstrate loss of one allele by dosage. 
A?g?Yg for Drugs Which Affect Pr^B^ n ilin Exnr^««^nn 
In another series of embodiments, the present invention 
provides assays for identifying small molecules or other 
compounds which are capable of inducing or inhibiting the 
expression of the presenilin genes and proteins (e.g., PSl or 
PS2) . The assays may be performed in vitro using non- transformed 
cells, immortalized cell lines, or recombinant cell lines, or is 
vivo using the transgenic animal models enabled herein. 

In particular, the assays may detect the presence of 
increased or decreased expression of PSl, PS2 or other 
presenilin-related genes or proteins on the basis of increased or 
decreased mRNA expression (using, e.g., the nucleic acid probes 
disclosed and enabled herein) , increased or decreased levels of 
PSl, PS2 or other presenilin-related protein products (using, 
e.g., the ant i -presenilin antibodies disclosed and enabled 
herein), or increased or decreased levels of expression of a 
marker gene (e.g., ^-galactosidase or lucif erase) operably joined 
to a presenilin 5', regulatory region in a recombinant construct.. 

Thus, for example, one may culture cells known to express a 
particular presenilin and add to the culture medium one or more 
test compounds. After allowing a sufficient period of time 
40 (e.g., 0-72 hours) for the compound to induce or inhibit, the - 



25 



30 



35 



wo 96/94099 



PCT/CA96/00263 



- 65 - 

expression of the presenilin, any change in levels of expression 
from an established baseline may be detected using any of the 
techniques described above and well known in the art. in 
particularly preferred embodiments, the cells are from an 
5 immortalized cell line such as a human neuroblastoma, 

glioblastoma or a hybridoma cell line. Using the nucleic acid 
probes and /or antibodies disclosed and enabled herein, detection 
of changes in the expression of a presenilin, and thus 
identification of the compound as an inducer or repressor of 
10 presenilin expression, requires only routine experimentation. 

In particularly preferred embodiments, a recombinant assay 
is employed in which a reporter gene such a )3-galactosidase, 
green fluorescent protein , alkaline phosphatase, or lucif erase 
is operably joined to the 5' regulatory regions of a presenilin 
15 gene. Preferred vectors include the Green Lantern l vector 

(GIBCO/BRL, Gaithersburg, MD and the Great EScAPe pSEAP vector 
(Clontech, Palo Alto) . The hPSl regulatory regions disclosed 
herein, or other presenilin regulatory regions, may be easily 
isolated and cloned by one of ordinary skill in the art in light 
20 of the present disclosure of the coding regions of these genes. 
The reporter gene and regulatory regions are joined in- frame {or 
in each of the three possible reading frames) so that 
transcription and translation of the reporter gene may proceed 
\mder the control of the presenilin regulatory elements. The 
25 recombinant construct may then be introduced into any appropriate 
cell type although mammalian cells are preferred, and human cells 
are most preferred. The transformed cells may be grown in 
culture and, after establishing the baseline level of expression 
of the reporter gene, test compounds may be added to the medium. 
30 The ease of detection of the expression of the reporter gene 
provides for a rapid, high through-put assay for the 
identification of inducers and repressors of the presenilin gene. 

Con5)p\inds identified by this method will have potential 
utility in modifying the expression of the PSi, PS2 or other 
35 presenilin- related genes in vivo. Th^«^ compounds may be further 
tested in the animal models disclosed and enabled herein to 
identify those compoimds having the most potent in vivo effects. 
In addition, as described herein with respect to small molecules 
having presenilin-binding activity, these molecules may serve as 
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"lead compoimds" for the further development of pharmaceuticals 
by, for example, subjecting the compounds to sequential 
modifications, molecular modeling, and other routine procedures 
employed in rational drug design. 
5 7. Identification of Compou nds with Presenilin BindiTia Capayi^Y 
In light of the present disclosure, one of ordinary skill in 
the art is enabled to practice new screening methodologies which 
will be useful in the identification of proteins and other 
compounds which bind to, or otherwise directly interact with, the 

10 presenilins. The proteins and compounds will include endogenous 
cellular components which interact with the presenilins in vivo 
and which, therefore, provide new targets for pharmaceutical and 
therapeutic interventions, as well as recombinant, synthetic and 
otherwise exogenous compoimds which may have presenilin binding 

15 capacity and, therefore, may be candidates for pharmaceutical 
agents. Thus, in one series of embodiments, cell lysates or 
tissue homogenates {e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which 
bind to one of the normal or mutant presenilins. Alternatively, 

20 any of a variety of exogenous compounds, both naturally occurring 
and/or synthetic (e.g., libraries of small molecules or 
peptides) , may be screened for presenilin binding capacity. 
Small molecules are particular preferred in this context because 
they are more readily absorbed after oral administration, have 

25 fewer potential antigenic determinants, and/or are more likely to 
cross the blood brain barrier than larger molecules such as 
nucleic acids or proteins. The methods of the present invention 
are particularly useful in that they may be used to identify 
molecules which selectively or preferentially bind to a mutant 

30 form of a presenilin protein (rather than a normal form) and, 
therefore, may have particular utility in treating the 
heterozygous victims of this dominant autosomal disease. 

Because the normal physiological roles of PSl and PS2 are 
still unknown, compounds which bind to normal, mutant or both 

35 forms of these presenilins may have utility in treatments and 
diagnostics. Comppunds which bind only to a normal presenilin 
may, for example, act as enhauicers of its normal activity and 
thereby at least partially compensate for the lost or abnormal 
activity of mutant forms of the presenilin in Alzheimer's Disease 

40 victims. Comppunds which bind to both normal and mutant forms of 
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a presenilin may have utility if they differentially affect the 
activities of the two forms so as to alleviate the overall 
departure from normal function. Alternatively, blocking the 
activity of both normal and mutant forms of either PSl or PS2 may 
5 have less severe physiological and clinical consequences than the 
normal progress of the disease and, therefore, compounds which 
bind to and inhibit the activity of both normal and mutant forms 
of a presenilin may be therapeutically useful. Preferably, 
however, compounds are identified which have a higher affinity of 
10 binding to mutant presenilin than to normal presenilin <e.g., at 
least 2-10 fold higher K.) and which selectively or preferentially 
inhibit the activity of the mutant form. Such compoxinds may be 
identified by using any of the techniques described herein and by 
then comparing the binding affinities of the candidate 
15 compound(s) for the normal and mutant forms of PSl or PS2 . 

The effect of agents which bind to the presenilins (normal 
or mutant forms) can be monitored either by the direct monitoring 
of this binding using instniments (e.g., BIAcore, LKB Pharmacia, 
Sweden) to detect this binding by, for example, a change in 
20 fluorescence, molecular weight, or concentration of either the 
binding agent or presenilin component, either in a solxible phase 
or in a substrate -bound phase. 

Once identified by the methods described above, the 
candidate compounds may then be produced in quantities sufficient 
25 for pharmaceutical administration or testing (e.g., fig or mg or 
greater quantities), and formulated in a pharmaceutical ly 
acceptable carrier (see, e.g.. Remington ' s Pharmaceut i ra } 
Sfiienogs, Gennaro, A., ed, , Mack Pub., 1990). These candidate 
compounds may then be administered to the transformed cells of 
3 0 the invention, to the transgenic animal models of the invention, 
to cell lines derived from the animal models or from human 
patients, or to Alzheimer's patients. The animal models 
described and enabled herein are of particular utility in further 
testing candidate compounds which bind to normal or mutant 
35 presenilin for their therapeutic efficacy. 

In addition, once identified by the methods described ^Oaove, 
the candidate compounds may also serve as ''lead compounds* in the 
design and development of new pharmaceuticals. For exatt^le, as 
in well known in the art, sequential modification of small 
40 molecules (e.g. , amino acid residue replacement with peptides; 
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functional group replacement with peptide or non-peptide 
compounds) is a standard approach in the pharmaceutical industry 
for the development of new pharmaceuticals. Such development 
generally proceeds from a *lead compound" which is shown to have 
5 at least some of the activity (e.g., PSl binding or blocking 

ability) of the desired pharmaceutical. In particular, when one 
or more compounds having at least some activity of interest 
(e.g., modulation of presenilin activity) are identified, 
structural comparison of the molecules can greatly inform the 

10 skilled practitioner by suggesting portions of the lead compounds 
which should be conserved and portions which may be varied in the 
design of new candidate compoxinds. Thus, the present invention 
also provides a means of identifying lead compounds which may be 
sequentially modified to produce new candidate compounds for use 

15 in the treatment of Alzheimer's Disease. These new compounds 
then may be tested both for presenilin-binding or blocking 
(e.g., in the binding assays described above) and for therapeutic 
efficacy (e.g., in the animal models described herein). This 
procedure may be iterated until compoiinds having the desired 

20 therapeutic activity and/or efficacy are identified. 

In each of the present series of embodiments, an assay is 
conducted to detect binding between a "presenilin con^onent" and 
some other moiety. Of particular utility will be sequential 
assays in which compoxinds are tested for the ability to bind to 

25 only the normal or only the mutant forms of the presenilin 

functional domains using mutant and normal presenilin components 
in the binding assays. Such compoxinds are expected to have the 
greatest therapeutic utilities, as described more fully below. 
The •'presenilin component" in these assays may be a complete 

30 normal or mutant form of a presenilin protein (e.g., an hPSl or 
hPS2 variant) but need not be. Rather, particular functional 
domains of the presenilins, as described above, may be employed 
either as separate molecules or as part of a fusion protein. For 
example, to isolate proteins or compounds that interact with 

35 thiese f\anctional domains, screening may be carried out using 
fusion constructs and/or synthetic peptides corresponding to 
these regions. Thus, for PS2, GST- fusion peptides may be made 
including sequences corresponding approximately to amino acids i 
to 87 (N-terrainus) , or 269-387 {TM6->7 loop) , or to any other 

40 conserved domain of intefrest. For shorter ftmctional domains, a 
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synthetic peptide may be produced corresponding, for example, 
approximately to amino acids 107 to 134 (TMl-»2 loop) . Similarly, 
for PSl, GST- or other fusion peptides may be produced including 
sequences corresponding approximately to amino acids 1 to 81 (N- 
5 terminus) or 266 to 410 (TM6->7 loop) or a synthetic peptide may 
be produced corresponding approximately to amino acids 101 to 131 
{TMl->2 loop) . Obviously, various combinations of fusion proteins 
and presenilin functional domains are possible and these are 
merely examples- In addition, the functional domains may be 

10 altered so as to aid in the assay by, for example, introducing 

into the f\inctional domain a reactive group or amino acid residue 
(e.g., cysteine) which will facilitate immobilization of the 
domain on a sxibstrate (e.g., using sulfhydryl reactions). Thus, 
for example, the PSl TMl-»2 loop fragment {31-mer) has been 

15 synthesized containing an additional C-terminal cysteine residue. 
This peptide will be used to create an affinity substrate for 
affinity chromatography (Sulfo-link; Pierce) to isolate binding 
proteins for microsequencing. Similarly, other fxinctional domain 
or antigenic fragments may be created with modified residues 

20 (see, e.g.. Example 10). 

The proteins or other compounds identified by these methods 
may be purified and characterized by any of the standard methods 
known in the art. Proteins may, for exan5>le, be purified and 
separated using electrophoretic (e.g,, SDS-PAGE, 2D PAGE) or 

25 chromatographic (e.g., HPLC) techniques and may then be 

microsequenced. For proteins with a blocked N-terminus, cleavage 
(e.g., by CNBr and/or trypsin) of the particular binding protein 
is used to release peptide fragments. Further 

purification/characterization by HPLC and microsequencing and/or 
30 mass spectrometry by conventional methods provides internal 
sequence data on such blocked proteins. For non-protein 
compoiinds, standard organic chemical analysis techniques (e.g., 
IR, NMR and mass spectrometry; fxinctional group amalysis; X-ray 
crystallography) may be employed to determine their structure and 
35 identity. 

Methods for screening cellular lysates, tissue homogenates, 
or small molecule libraries for candidate presehilin-hinding 
molecules are well known in the art and, in light of the present 
disclosure, may now be employed to identify compoxmds which bind 
40 to normal or mutant presenilin con5>onents or which modulate 
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presenilin activity as defined by non-specific measures (e.g., 
changes in intracellular Ca'\ GTP/GDP ratio) or by specific 
measures (e.g., changes in A|3 peptide production or changes in 
the expression of other dovmstream genes which can be monitored 
5 by differential display, 2D gel electrophoresis, differential 
hybridization, or SAGE methods) . The preferred methods involve 
variations on the following techniques: (l) direct extraction 
by affinity chromatography; (2) co- isolation of presenilin 
components and bound proteins or other compoiuids by 
10 immunoprecipitation; (3) the Biomolecular Interaction Assay 
(BIAcore) ; and (4) the yeast two-hybrid systems. These and 
others are discussed separately below. 
A. Affinity Chromatooranhv 

In light of the present disclosure, a variety of affinity 
15 binding techniques well Jcnown in the art may be employed to 

isolate proteins or other compounds which bind to the presenilins 
disclosed or otherwise enabled herein. In general, a presenilin 
component may be immobilized on a substrate (e.g., a column or 
filter) and a solution including the test compo\ind(s) is 
20 contacted with the presenilin protein, fusion or fragment xonder 
conditions which are permissive for binding. The substrate is 
then washed with a solution to remove unbound or weakly bound 
molecules. A second wash may then elute those compounds which 
strongly bound to the immobilized normal or mutant presenilin 
25 component. Alternatively, the test compounds may be immobilized 
and a solution containing one or more presenilin components may 
be contacted with the column, filter or other substrate. The 
ability of the presenilin component to bind to the test confounds 
may be determined as above or a labeled form of the presenilin 
30 con5)onent (e.g., a radio-labeled or chemi luminescent functional 
domain) may be used to more rapidly assess binding to the 
substrate- immobilized compound (s) . In addition, as both PSl and 
PS2 are believed to be membrane associated proteins, it may be 
preferred that the presenilin proteins, fusion or fragments be 
35 incorporated into lipid bilayers (e.g. . liposomes) to promote 
their proper folding, This is particularly true when a. 
presenilin component including at least one transmembrane domain 
is employed. Such presenilin -liposomes may be immobilized on 
substrates (either directly or by means of another element in the 
40 liposome membrane), passed oyer substrates with immobilized test 
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compounds, or used in any of a variety of other well knovm 
binding assays for membrane proteins. Alternatively, the 
presenilin component may be isolated in a membrane fraction from 
cells producing the component, and this membrane fraction may be 
used in the binding assay. 
B. Co-Immunoprecinit^ario^ 

Another well characterized technique for the isolation of 
the presenilin components and their associated proteins or other 
compounds is direct immunoprecipitation with antibodies. This 
procedure has been successfully used, for example, to isolate 
many of the synaptic vesicle associated proteins (Phizicky and 
Fields, 1994). Thus, either normal or mutant, free or membrane- 
bound presenilin components may be mixed in a solution with the 
candidate compound (s) under conditions which are permissive for 
binding, and the presenilin component may be immunoprecipitated. 
Proteins or other compounds which co-immunoprecipitate with the 
presenilin component may then be identified by standard 
techniques as described above. General techniques for 
immunoprecipitation may be found in, for example, Harlow and 
Lane, (1988) Ant i ^frffdigg; A Laboratory Manual cold Spring Harbor 
Press, Cold Spring Harbor, NY. 

The antibodies employed in this assay, as described and 
enabled herein, may be polyclonal or monoclonal, and include the 
various antibody fragments (e.g.. Fab, P(ab')„) as well as single 
25 chain antibodies, and the like. 

The Biomolecular I nteracting ft fffV 
Another useful method for the detection and isolation of 
binding proteins is the Biomolecular Interaction Assay or 
"BIAcore" system developed by Pharmacia Biosensor and described 
30 in the manufacturer's protocol (LKB Pharmacia, Sweden). In light 
of the present disclosure, one of ordinary skill in the art is 
now enabled to employ this system, or a substantial equivalent, 
to identify proteins or other compounds having presenilin binding 
capacity. The BIAcore system uses an affinity purified anti-GST 
35 antibody to immobilize GST-fusion proteins onto a sensor chip. 

Obviously, other. fusion proteins and corresponding antibodies may 
be substituted. The sensor utilizes surface plasmon resonance 
which is an optical phenomenon that detects changes in refractive 
indices. A homogenate of a tissue of interest is passed over the 
40 immobilized fusion protein and protein-protein interactions are 
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registered as changes in the refractive index. This system can 
be used to determine the kinetics of binding and to assess 
whether any observed binding is of physiological relevance. 

D. The Yeast Two-Hvbr id Svttt.t>m 

5 The yeast "two-hybrid" system takes advantage of 

transcriptional factors that are composed of two physically 
separable, functional domains (Phizicky and Fields, 1994). The 
most commonly used is the yeast GAL4 transcriptional activator 
consisting of a DNA binding domain and a transcriptional 

10 activation domain. Two different cloning vectors are used to 
generate separate fusions of the GAL4 domains to genes encoding 
potential binding proteins. The fusion proteins are co- 
expressed, targeted to the nucleus and, if interactions occur, 
activation of a reporter gene (e.g., lacZ) produces a detectable 

15 phenotype. For example, the Clontech Matchmaker System- 2 may be 
used with the Clontech brain cDNA GAL4 activation domain fusion 
library with presenilin-GAL4 binding domain fusion clones 
(Clontech, Palo Alto, CA) . in light of the disclosures herein, 
one of ordinary skill in the art is now enabled to produce a 

20 variety of presenilin fusions, including fusions including either 
normal or mutant functional domains of the presenilin proteins, 
and to screen such fusion libraries in order to identify 
presenilin binding proteins. 

E. Other Methods 

25 The nucleotide sequences and protein products, including 

both mutant and normal forms of these nucleic acids and their 
corresponding proteins, can be used with the above techniques to 
isolate other interacting proteins, and to identify other genes 
whose expression is altered by the over-expression of normal 

30 presenilin sequences, by the under-ei^ression of normal 

presenilins sequences, or by the expression of mutant presenilin 
sequences. Identification of these interacting proteins, as well 
as the identification of other genes whose expression levels are 
altered in the fa;ce of mutant presenilin sequences (for instance) 

35 will identify other gene targets which have direct relevance to 
the pathogenesis of this,, disease in its clinical or; pathological 
forms. Specifically, other genes will be identified which may 
themselves be the site of other mutations causing Alzheimer's 
Disease, or which can themselves be targeted therapeutically 

40 (e.g., to reduce their expression levels to normal or to 
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pharmacologically block the effects of their over-eaqjression) as 
a potential treatment for this disease. Specifically, these 
techniques rely on PCR-based and/or hybridization-based methods 
to identify genes which are differentially expressed between two 
5 conditions (a cell line expressing normal presenilins compared to 
the same cell type expressing a mutant presenilin sequence) . 
These techniques include differential display, serial analysis of 
gene expression (SAGE) , and mass -spectrometry of protein 2D-gels 
and subtractive hybridization (reviewed in Nowak, 1995 and Kahn, 
10 1995) . 

As will be obvious to one of ordinary skill in the art, 

there are numerous other methods of screening individual proteins 

or other compounds, as well as large libraries of proteins or 

other compounds (e.g., phage display libraries and cloning 

systems from Stratagene, La Jolla, CA) to identify molecules 

which bind to normal or mutant presenilin components. All of 

these methods comprise the step of mixing a normal or mutant 

presenilin protein, fusion, or fragment with test con^jounds, 

allowing for binding (if any), and assaying for botind complexes. 

All such methods are now enabled by the present disclosure of 

substantially pure presenilins, substantially pure presenilin 

functional domain fragments, presenilin fusion proteins, 

presenilin antibodies, and methods of making and using the same. 

Hgtb9dg T^m^^Yim ComPOUndB Modulating P r esenilin 
25 Activity 

In another series of embodiments, the present invention 
provides for methods of identifying compounds with the ability to 
modulate the activity of normal and mutant presenilins. As used 
with respect to this series of embodiments, the term 6activity6 
broadly includes gene and protein expression, presenilin protein 
post- translation processing, trafficking and localization, and 
any functional activity (e.g., enzymatic, receptor-effector, 
binding, channel), as well as downstream affects of any of these. 
The presenilins appear to be integral membrane proteins normally 
associated with the endoplasmic reticulum and/or Golgi apparatus 
and may have functions involved in the ,tj:an8port or traff icking 
of APP and/or the regulation of intracellular calcium levels. In 
addition, it is known that presenilin mutations are associated 
with the increased production of A/3 peptides, the appearance of 
amyloid plaques and neurofibrillary tangles, decreases in 
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cognitive function, and apoptotic cell death. Therefore, using 
the transformed cells and treuisgenic animal models of the present 
invention, cells obtained from subjects bearing a mutant 
presenilin gene, or animals or human subjects bearing naturally 
5 occurring presenilin mutations, it is now possible to screen 
candidate pharmaceuticals and treatments for their therapeutic 
effects by detecting changes in one or more of these functional 
characteristics or phenotypic manifestations of normal or mutant 
presenilin expression. 

10 Thus, the present invention provides methods for screening 

or assaying for proteins, small molecules or other compounds 
which modulate presenilin activity by contacting a cell in vivo 
or in vitro with a candidate compound and assaying for a change 
in a marker associated with normal or mutant presenilin activity. 

15 The marker associated with presenilin activity may be any 
measurable biochemical, physiological, histological and/or 
behavioral characteristic associated with presenilin expression. 
In particular, useful markers will include any measurable 
biochemical, physiological, histological and/or behavioral 

20 characteristic which distinguishes cells, tissues, animals or 
individuals bearing at least one mutant presenilin gene from 
their nonnal counterparts. In addition, the marker may be any 
specific or non-specific measure of presenilin activity. 
Presenilin specific measures include measures of presenilin 

25 expression (e.g., presenilin mRNA or protein levels) which may 
employ the nucleic acid probes or antibodies of the present 
invention. Non-specific measures include changes in cell 
physiology such as pH, intracellular calcium, cyclic AMP levels, 
GTP/GDP ratios, phosphatidylinositol activity, protein 

30 phosphorylation, etc., which cam be monitored on devices such as 
the cytosensor microphysiometer (Molecular Devices Inc., United 
States) . The activation or inhibition of presenilin activity in 
its mutant or normal form can also be monitored by examining 
changes in the expression of other genes which are specific to 

35 the presenilin pathway leading to Alzheimer's Disease. These can 
be assc^yed by such techniques as differential display, 
differential hybridization, and SAGE (sequential analysis of 
gene expression) , as well as by two dimensional gel 
electrophoresis of cellular lysates. In each case, the 

40 differentially-expressed genes can be asqertainc^d by inspection 
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of identical studies before and after application of the 
candidate compound. Furthermore, as noted elsewhere, the 
particular genes whose expression is modulated by the 
administration of the candidate compound can be ascertained by 
5 cloning, nucleotide sequencing, amino acid sequencing, or mass 
spectrometry (reviewed in Nowak, 1995) , 

In general, a cell may be contacted with a candidate 
compound and, after an appropriate period (e.g,, 0-72 hours for 
most biochemical measures of cultured cells), the marker of 

10 presenilin activity may be assayed euid compared to a baseline 
measurement. The baseline measurement may be made prior to 
contacting the cell with the candidate compound or may be an 
external baseline established by other experiments or known in 
the art. The cell may be a transformed cell of the present 

15 invention or an explant from an animal or individual. In 
particular, the cell may be an explant from a carrier of a 
presenilin mutation (e.g., a human subject with AlzheimerOs 
Disease) or an animal model of the invention (e.g., a transgenic 
nematode or mouse bearing a mutant presenilin gene) . To augment 

20 the effect of presenilin mutations on the A/3 pathway, transgenic 
cells or animals may be eTi5>loyed which have increased A)8 
production. Preferred cells include those from neurological 
tissues such as neuronal, glial or mixed cell cultures; auid 
cultured fibroblasts, liver, kidney, spleen, or bone marrow. The 

25 cells may be contacted with the candidate compoxands in a culture 
in vitro or may be administered in vivo to a live animal or human 
sxibject. For live animals or human subjects, the test compound 
may be administered orally or by any parenteral route suitable to 
the compound. For clinical trials of human subjects, 

30 measurements may be conducted periodically (e.g., daily, weekly 
or monthly) for several months or years. 

Because most carriers of presenilin mutations will be 
heterozygous (i.e., bearing one normal and one mutant presenilin 
allele), compounds may be tested for their ability to modulate 

35 normal, as well as presenilin activity. Thus, for example, 

compounds which enhance the function of normal presenilins may . 
have utility in treating presenilin associated disorders such as 
AlzheimerOs Disease. Alternatively, because suppression of the 
activity of both normal and mutant presenilins in a heterozygous 

40 individual may have less severe clinical consequences than 
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progression of the associated disease, it may be desired to 
identify compound which inactivate or suppress all forms of the 
presenilins. Preferably, however, compounds are identified which 
selectively or specifically inactivate or suppress the activity 
5 of a mutant presenilin without disrupting the function of a 
normal presenilin gene or protein- 

In light of the identification, characterization, and 
disclosure herein of the presenilin genes and proteins, the 
presenilin nucleic acid probes and antibodies, and the presenilin 
10 transformed cells and transgenic animals of the invention, one of 
ordinary skill in the art is now enabled by perform a great 
variety of assays which will detect the modulation of presenilin 
activity by candidate compounds. Particularly preferred and 
contemplated embodiments are discussed in some detail below. 
15 A. Presenilin Expression 

In one series of embodiments, specific measures of 
presenilin expression are employed to screen candidate compounds 
for their ability to affect presenilin activity. Thus, using the 
presenilin nucleic acids and antibodies disclosed and otherwise 
20 enabled herein, one may use mRNA levels or protein levels as a 
marker for the ability of a candidate compound to modulate 
presenilin activity. The use of such probes and antibodies to 
measure gene and protein expression is well known in the art and 
discussed elsewhere herein. Of particular interest may be the 
25 identification of compoxinds which can alter the relative levels 
of different splice variants of the presenilins. Many of the 
presenilin mutations associated with AlzheimerOs Disease, for 
example, are located in the region of the putative TM6->7 loop 
which is subject to alternative splicing in some peripheral 
30 tissues (e.g., white blood cells). Compoxinds which can increase 
the relative frequency of this splicing event may, therefore, be 
effective in preventing the expression of mutations in this 
region. 

B. Intracellular Localization 

35 In another series of embodiments, compounds may be screened 

for their ability to iriodulate the activity of the presenilins 
based upon their effects on the trafficking and intracellular 
localization of the presenilins. The presenilins have been seen 
immtinocytochemically to be localized in membrane structures 

40 associated with the encioplasmic reticulum and Golgi apparatus, 
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and one presenilin mutant (H163R) , but not others, has been 
visualized in small cytoplasmic vesicles of imknown fiinction* 
Differences in localization of mutant and normal presenilins may, 
therefore, contribute to the etiology of presenilin-related 
5 diseases. Compoimds which can affect the localization of the 
presenilins may, therefore, be identified as potential 
therapeutics. Standard techniques known in the art may be 
employed to detect the localization of the presenilins. 
Generally, these techniques will employ the antibodies of the 

10 present invention, and in particular antibodies which selectively 
bind to one or more mutant presenilins but not to normal 
presenilins. As is well known in the art, such auitibodies may be 
labeled by any of a variety of techniques (e.g., fluorescent or 
radioactive tags, labeled secondary antibodies, avidin-biotin, 

15 etc.) to aid in visualizing the intracellular location of the 
presenilins. The presenilins may be co-localized to particular 
structures, as in known in the art, using amtibodies to markers 
of those structures (e.g., TGN38 for the Golgi, transferrin 
receptor for post-Golgi transport vesicles, IiAMP2 for lysosomes) . 

20 Western blots of purified fractions from cell lysates enriched 
for different intracellular membrane botmd organelles (e.g., 
lysosomes, synaptosomes , Golgi) may also be en^loyed. In 
addition, the relative orientation of different domains of the 
presenilins across cellular domains may be assayed using, for 

25 example, electron microscopy and antibodies raised to those 
domains . 

B. Ion Regulation/Metabolism 

In another series of embodiments, compounds may be screened 
for their ability to modulate the activity of the presenilins 

30 based upon measures in intracellular Ca^\ Na* or levels or 
metabolism. As noted above, the presenilins are membrcuie 
associated proteins which may serve as, or interact with, ion 
receptors or ion channels. Thus, compounds may be screened for 
their ability to modulate presenilin-related calcium or other ion 

35 metabolism either in vivo or in vitro bv measurements of ion 
channel fluxes and/or transmembrane voltage or current fluxes 
using patch clsunp, voltage clamp and fluorescent dyes sensitive 
to intracellular calcitim or transmembrame voltage. Ion channel 
or receptor fxinctipn can also be assayed by measurements of 

40 activation of second messengers such as cyclic AMP, cGMP tyrosine 
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kinases, phosphates, increases in intracellular Ca»* levels, etc. 
Recombinantly made proteins may also be reconstructed in 
artificial membrane systems to study ion channel conductance and, 
therefore, the 6cell6 employed in such assays may comprise an 
5 artificial membrane or cell. Assays for changes in ion 

regulation or metabolism can be performed on cultured cells 
expressing endogenous normal or mutant presenilins. Such studies 
also can be performed on cells transfected with vectors capable 
of expressing one of the presenilins, or functional domains of 

10 one of the presenilins, in normal or mutant form. In addition, 
the enhance the signal measured in such assays, cells may be co- 
transfected with genes encoding ion channel proteins. For 
example, jC^nppus oocytes or rat kidney (HEK293) cells may be co- 
transfected with normal or mutant presenilin sequences and 

15 sequences encoding rat brain Na* pi sxibunits, rabbit skeletal 

muscle Ca'* pi subunits, or rat heart r /Si subunits. Changes in 
presenilin-related or presenilin-raediated ion cheumel activity 
can be measured by two-microelectrode voltage -clamp recordings in 
oocytes or by whole-cell patch-clamp recordings in HEK293 cells. 

20 C. APOPtosis or Cell D^ath 

In another series of embodiments, compounds may be screened 
for their ability to modulate the activity of the presenilins 
based upon their effects on presenilin-related or presenilin- 
mediated apoptosis or cell death. Thus, for example, baseline 
25 rates of apoptosis or cell death may be established for cells in 
culture, or the baseline degree of neuronal loss at a particular 
age may be established post-mortem for animal models or human 
subjects, and the ability of a candidate compound to suppress or 
inhibit apoptosis or cell death may be measured. Cell death may 
30 be measured by standard microscopic techniques (e.g., light 
microscopy) or apoptosis may be measured more specifically by 
characteristic nuclear morphologies or DNA fragmentation patterns 
which create nucleosomal ladders (see, e.g., Gavrieli et al., 
1992; Jacobson et al., 1993; Vito et al., 1996). TtniEL may also 
35 be employed to evaluate cell death in brain (see, e.g., Lassmann 
et al. , 1995) . In preferred embodiments, coiqpounds are screened . 
for their ability to suppress or inhibit neuronal loss in the 
transgenic animal models of the invention. Transgenic mice 
bearing, for exanple, a mutant huraam, mutant mouse, or humanized 
mutant presenilin gene may be employed to identify or evaluate 
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compounds which may delay or arrest the neurodegeneration 
associated with AlzheimerOs Disease. A similar transgenic mouse 
model, bearing a mutant APP gene, has recently been reported by 
Games et al. (1995) . 
5 D. Afl Pepti de Product ion 

In another series of embodiments, compounds may be screened 
for their ability to modulate presenilin- related or presenilin- 
mediated changes in APP processing. The A/8 peptide is produced 
in several isoforms resulting from differences in APP processing. 
The A^ peptide is a 39 to 43 amino acid derivative of /SAPP which 
is progressively deposited in diffuse and senile plagues and in 
blood vessels of subjects with AD. in human brain, A/J peptides 
are heterogeneous at both the N- and C-termini. Several 
observations, however, suggest that both the full length and N- 
terminal truncated forms of the long- tailed A^ peptides ending at 
residue 42 or 43 (i.e., Aj8l-42/43 and A^x-42/43) have a more 
important role in AD than do peptides ending at residue 40. 
Thus, A^i-42/43 and A^x-42/43 are an early and prominent feature 
of both senile plaques and diffuse plaques, while peptides ending 
20 at residue 40 (i.e., A^l-40 and A/5x-40) are predominantly 

associated with a subset of mature plaques and with amyloidotic 
blood vessels (see, e.g., Iwatsubo et al., 1995; Gravina et al., 
1995; Tamaoka et al., 1995; Podlisny et al. 1995). Furthermore, 
the long-tailed isoforms have a greater propensity to fibril 
formation, and are thought to be more neurotoxic than A/31-40 
peptides (Pike et al., 1993; Hilbich et al., 1991). Finally, 
missense mutations at codon 717 of the ^APP gene associated with 
early onset FAD result in overproduction of long-tailed A/? in the 
brain of affected mutation carriers, in peripheral cells and 
plasma of both affected and presymptomatic carriers, and in cell 
lines transfected with ^APP„, mutant cDNAs (Tamaoka et al., 1994; 
Suzuki et al., 1994) As described in Example 18 below, we' now 
disclose that increased production of the long- forms of the A/3 
peptide are also associated with mutations in the presenilin 
35 'genes.. 

Thus, in one series of embodiments , the prewt. invention - 
provides methods for screening candidate compounds for their 
ability to block or inhibit the increased production of long 
isoforms of the A^ peptides in cells or transgenic animals 
40 expressing a mutant presenilin gene, m particular, the ptresent 



25 



30 



CI IDC'I'1 1 1 m rueer rnt ti f 



wo 96/34099 



PCT/CA96/00263 



- 80 - 

invention provides such methods in which cultured mannnalian 
cells, such as brain cells or fibroblasts, have been transformed 
according to the methods disclosed herein, or in which transgenic 
animals, such as rodents or non-human primates, have been 
5 produced by the methods disclosed herein, to express relatively 
high levels of a mutant presenilin. Optionally, such cells or 
transgenic animals may also be transformed so as to express a 
normal form of the /3APP protein at relatively high levels. 

In this series of embodiments, the candidate compound is 
10 administered to the cell line or transgenic animals (e.g., by 
addition to the media of cells in culture; or by oral or 
parenteral administration to an animal) and, after an appropriate 
period (e.g., 0-72 hours for cells in culture, days or months for 
animal models), a biological sample is collected (e.g., cell 
15 culture supernatant or cell lysate from cells in culture; tissue 
homogenate or plasma from an animal) and tested for the level of 
the long isoforms of the AP peptides. The levels of the peptides 
may be determined in an absolute sense (e.g., nMol/ml) or in a 
relative sense (e.g., ratio of long to short isoforms). The 
20 A/5 isoforms may be detected by any means Imown in the art (e.g., 
electrophoretic separation and sequencing) but, preferably, 
antibodies which are specific to the long isoform are employed to 
determine the absolute or relative levels of the A/51-42/43 or 
A^x-42/43 peptides. Candidate pharmaceuticals or therapies which 
25 reduce the absolute or relative levels of these long AjS isoforms, 
particularly in the transgenic animal models of the invention, 
are likely to have therapeutic utility in the treatment of 
Alzheimer's Disease, or other disorders caused by presenilin 
mutations or aberrations in APP metabolism. 
30 E. PhggPhPrvlation of Microtubul e Assorianed Prot^P-i^ B 

In another series of embodiments, candidate compounds may be 
screened for their ability to modulate presenilin activity by 
assessing the effect of the compound on levels of phosphorylation 
of microt\ibule associated proteins (MAPs) such as Tau. The 
35 abnormal phosphorylation of Tau and other MAPs in the brains of 
victims of Alzheimerps Disease is well. }aiown in the art. Thus., 
compounds which prevent or inhibit the abnormal phosphorylation 
of MAPS may have utility in treating presenilin associated 
diseases such as AD. As above, cells from normal or mutant 
40 animals. or subjects, or the transformed cell lines and animal 
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models of the invention may be en5)loyed. Preferred assays will 
employ cell lines or animal models transformed with a mutant 
human or humanized mutant presenilin gene. The baseline 
phosphorylation state of MAPs in these cells may be estGQ>lished 
5 and then Ccuididate compo\inds may be tested for their ability to 
prevent, inhibit or cotinteract the hyperphosphorylation 
associated with mutauits- The phosphorylation state of the MAPs 
may be determined by any standard method known in the art but, 
preferably, antibodies which bind selectively to phosphorylated 
10 or unphosphorylated epitopes are employed- Such antibodies to 
phosphorylation epitopes of the Tau protein are known in the art 
(e.g. , ALZ50) . 

9. Screening and Diagnostics for Alzheimer^ s Disease 
A. General Diagnostic Methods 

15 The presenilin genes and gene products, as well as the 

presenilin-derived probes, primers and antibodies, disclosed or 
otherwise enabled herein, are useful in the screening for 
carriers of alleles associated with Alzheimer's Disease, for 
diagnosis of victims of Alzheimer's Disease, and for the 

20 screening and diagnosis of related presenile and senile 
dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral 
hemorrhage, all of which are seen to a greater or lesser extent 
in syn5>tomatic human subjects bearing mutations in the PSl or PS2 

25 genes or in the APP gene. Individuals at risk for Alzheimer's 

Disease, such as those with AD present in the family pedigree, or 
individuals not previously known to be at risk, may be routinely 
screened using probes to detect the presence of a mutant 
presenilin gene or protein by a variety of techniques. Diagnosis 

30 of inherited cases of these diseases can be accomplished by 
methods based upon the nucleic acids (including genomic and 
mRNA/cDNA sequences), proteins, and/or antibodies disclosed and 
enabled herein, including functional assays designed to detect 
failure or augmentation of the normal presenilin activity and/or 

35 the presence of specific new activities conferred by the mutant 
presenilins. PreferaOjly, the methods and products are based upon 
the human PSl or PS2 nucleic acids, proteins or antibodies, as 
disclosed or otherwise enabled herein. As will be obvious to one 
of ordinary skill in the art, however, the significant 

40 evolutionary conservation of large portions of the PSl and PS2 
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nucleotide and amino acid sequences, even in species as diverse 
as humans, mice, C. eleaana. and Drosophila . allow the skilled 
artisan to make use of such non-human presenilin-homologue 
nucleic acids, proteins and antibodies, even for applications 
5 directed toward human or other animal subjects. Thus, for 
brevity of exposition, but without limiting the scope of the 
invention, the following description will focus upon uses of the 
human horaologues of PSl and PS2. It will be vmderstood, however, 
that homologous sequences from other species, including those 
10 disclosed herein, will be equivalent for many purposes. 

As will be appreciated by one of ordinary skill in the art, 
the choice of diagnostic methods of the present invention will be 
influenced by the nature of the available biological samples to 
be tested and the nature of the information required. PSl. for 

15 example, is highly expressed in brain tissue but brain biopsies 
are invasive and expensive procedures, particularly for routine 
screening. Other tissues which express PSl at significant 
levels, however, may demonstrate alternative splicing (e.g., 
lymphocytes) and, therefore, PSl mRNA or protein from such cells 

20 may be less informative. Thus, an assay based upon a subject's 
genomic PSl DNA may be the preferred because no information will 
be dependent upon alternative splicing and because essentially 
any nucleate cells may provide a usable sample. Diagnostics 
based upon other presenilins (e.g., hPS2, mPSl) are stibject to 

25 similar considerations: availability of tissues, levels of 

expression in various tissues, and alternative mRNA and protein 
products resulting from alternative splicing. 
B. Protein Based Screens and Diaemn^^j^p 

When a diagnostic assay is to be based upon presenilin 

30 proteins, a variety of approaches are possible. For example, 
diagnosis can be achieved by monitoring differences in the 
electrophoretic mobility of normal and mutant proteins. Such an 
approach will be particularly useful in identifying mutants in 
which charge substitutions are present, or in which insertions, 

35 deletions or substitutions have -resulted in a significant change 
in the electrophoretic migration of the. resultant protein. 
Alternatively, diagnosis may be based upon differences in the 
proteolytic cleavage patterns of normal- and mutant proteins, 
differences in molar ratios of the various amino acid residues, 

40 or by functional asetays demonstrating altered function of the 
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gene products. 

In preferred embodiments, protein-based diagnostics will 
employ differences in the ability of antibodies to bind to normal 
and mutamt presenilin proteins (especially hPSl or hPS2) . Such 
5 diagnostic tests may employ antibodies which bind to the normal 
proteins but not to mutant proteins, or vice versa. In 
particular, an assay in which a plurality of monoclonal 
antibodies, each capable of binding to a mutant epitope, may be 
en5>loyed. The levels of anti -mutant antibody binding in a sample 
10 obtained from a test subject (visualized by, for exair^jle, 

radiolabelling, ELISA or chemiluminescence) may be compared to 
the levels of binding to a control sample. Alternatively, 
antibodies which bind to normal but not mutant presenilins may be 
employed, and decreases in the level of antibody binding may be 

15 used to distinguish homozygous normal individuals from mutant 
heterozygotes or homozygotes. Such antibody diagnostics may be 
used for in si^u immvmohistochemistry using biopsy sauries of CNS 
tissues obtained anteraortem or postmortem, including 
neuropathological structures associated with these diseases such 

20 as neurofibrillary tangles and amyloid plaques, or may be used 
with fluid samples such a cerebrospinal fluid or with peripheral 
tissues such as white blood cells. 
C- Nucleic Acid Based Sereftna an d DiacmoshirB 

When the diagnostic assay is to be based upon nucleic acids 

25 from a sample, the assay may be based upon mRNA, cDNA or genomic 
DMA. When mRNA is used from a san5>le, many of the same 
considerations apply with respect to source tissues and the 
possibility of alternative splicing. That is, there may be 
little or no expression of transcripts unless appropriate tissue 

30 sources are chosen or available, and alternative splicing may 
result in the loss of some information or difficulty in 
interpretation. However, we have already shown (Sherrington et 
al., 1995; Rogaev, 1995) that mutations in the 5' OTR, 3' UTR, 
open reading frame and splice sites of both PSl and PS2 can 

35 reliably be identified in mRNA/cDNA isolated from white blood 

cells and/or skin fibroblasts. Whether mRNA, cDNA or genomic DNA 
is assayed, standard methods well known in the art may be used to 
detect the presence of a particular sequence either in situ or 
Sltca (see, e.g., Sambrook et al . , (1969) Molecular Cloning ; A 
t^flfrgy^tgyy MftT^W^f 2nd ed. , Cold Spring Harbor Press, Cold Spring 
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Harbor, NY) . As a general matter, however, cuiy tissue with 
nucleated cells may be examined. 

Genomic DNA used for the diagnosis may be obtained from body 
cells, such as those present in the blood, tissue biopsy, 
5 surgical specimen, or autopsy material. The DNA may be isolated 
and used directly for detection of a specific sequence or may be 
amplified by the polymerase chain reaction (PGR) prior to 
analysis. Similarly, RNA or cDNA may also be used, with or 
without PGR amplification. To detect a specific nucleic acid 
10 sequence, direct nucleotide sequencing, hybridization using 

specific oligonucleotides, restriction enzyme digest and mapping, 
PGR mapping, RNase protection, chemical mismatch cleavage, 
ligase-mediated detection, and various other methods may be 
employed. Oligonucleotides specific to particular sequences can 
15 be chemically synthesized and labeled radioactively or non- 
radioactively (e.g., biotin tags, ethidium bromide) , and 
hybridized to individual samples immobilized on membranes or 
other solid- supports (e.g., by dot-blot or transfer from gels 
after electrophoresis), or in solution. The presence or absence 
20 of the target sequences may then be visualized using methods such- 
as autoradiography, fluorometry, or colorimetry. These 
procedures can be automated using redundant, short 
oligonucleotides of known sequence fixed in high density to 
silicon chips. 
25 (1) Appropriate Probea and Prim«»r« 

Whether for hybridization, RNase protection, ligase-mediated 
detection, PGR amplification or any other standards methods 
described herein and well known in the art, a variety of 
subsequences of the presenilin sequences disclosed or otherwise 
30 enabled herein will be useful as probes and/ or primers. These 
sequences or subsequences will include both normal presenilin 
sequences and deleterious mutant sequences. In general, useful 
sequences will include at least 8-9, more preferably 10-50, and 
most preferably 18-24 consecutive nucleotides from the presenilin 
35 ihtrons, exons or intron/exon boundaries. Depending upon the 
targfet sequence, the specificity required, amd future 
technological developments, shorter sequences may also have 
utility. Therefore, any presenilin derived sequence which is 
employed to isolate, clone, amplify, identify or otherwise 
manipulate a presenilin sequence may be regarded as an 
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appropriate probe or primer. Particularly contett^lated as useful 
will be sequences including nucleotide positions from the 
presenilin genes in which disease-causing mutations are known to 
be present, or sequences which flank these positions. 
5 (a) PSl Probes and Primers 

As discussed ahove, a variety of disease- causing mutations 
have now been identified in the humcui PSl gene. Detection of 
these and other PSl mutations is now enabled using isolated 
nucleic acid probes or primers derived from normal or mutant PSl 

10 genes. Particularly contemplated as useful are probes or primers 
derived from sequences encoding the N-terminus, the TM1-TM2 
region, and the TM6-TM7 region. As disclosed above, however, 
mutations have already been detected which affect other regions 
of the PSl protein cuid, using the methods disclosed herein, more 

15 will undoubtedly be detected. Therefore, the present invention 
provides isolated nucleic acid probes and primers corresponding 
to normal and mutcuit sequences from any portion of the PSl gene, 
including introns and 5' and 3' UTRs, which may be shown to be 
associated with the development of Alzheimer's Disease. 

20 Merely as an example, and without limiting the invention, 

probes and primers derived from the hPSl DNA segment immediately 
surrounding the C410Y mutation may be employed in screening and 
diagnostic methods. This mutation arises, at least in some 
individuals, from the substitution of an A for a G at position 

25 1477 of SEQ ID NO: 1. Thus, genomic DNA, mRNA or cDNA acquired 
from peripheral blood samples from an individual can be screened 
using oligonucleotide probes or primers including this 
potentially mutant site. For hybridization probes for this 
mutation, probes of 6-50, and more preferably 18-24 bases 

30 spanning the mutation site (e.g., bp 1467-1487 of SEQ ID NO: l) 

may be employed. If the probe is to be used with mRNA, it should 
of course be complementary to the mRNA (and, therefore, 
correspond to the non-coding streuid of the PSl gene. For probes 
to be used with genomic DNA or cDNA, the probe may be 

35 complementary to either strand. To detect sequences including 
this mutation by PGR methods, appropriate primers would include 
sequences of 8-50, and preferably 18-24, nucleotides in length 
derived from the regions flanking the mutation on either side, 
and which correspond to positions anywhere from l to 1000 bp, but 

40 priBferably 1-200 bp, removed from the site of the mutation. PGR 
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primers which are 5' to the mutation site (on the coding strand) 
should correspond in sequence to the coding strand of the PSl 
gene whereas PGR primers which are 3' to the mutation site (on 
the coding stremd) should correspond to the non-coding or 
5 antisense strand (e.g., a 5' primer corresponding to bp 1451-1468 
of SEQ ID NO: 1 and a 3' primer corresponding to the complement 
of 719-699 of SEQ ID NO: 14) . 

Similar primers may be chosen for other PSl mutations or for 
the mutational "hot spots" in general. For example, a 5' PGR 
10 primer for the M146L mutation (A->C at bp 684) may comprise a 

sequence corresponding to approximately bp 601-620 of SEQ ID NO: 

I and a 3' primer may correspond to the complement of 
approximately bp 1328-1309 of SEQ ID NO: 8, Note that this 
example employs primers from both intronic and exonic sequences. 

15 As another example, an appropriate 5' primer for the A246E 

mutation (C->A at bp 985) may comprise a sequence corresponding to 
approximately bp 907-925 of SEQ ID NO: 1 or a 3 ' primer 
corresponding to the complement of approximately bp 1010-990 of 
SEQ ID NO: 1. As another example, a 5' primer for the H163R 

20 mutation (A-*G at bp 736 of SEQ ID NO: 1 or bp 419 of SEQ ID NO: 
9) comprising a sequence corresponding to approximately bp 354- 
375 of SEQ ID NO: 9 with a 3' primer corresponding to the 
con5>lement of approximately bp 581-559 of SEQ ID NO: 9, 
Similarly, intronic or exonic sequences may be employed, for 

25 example, to produce a 5' primer for the L286V mutation (C->G at bp 
1104 of SEQ ID NO: 1 or bp 398 of SEQ ID NO: 11) comprising a 
sequence corresponding to approximately bp 249-268 of SEQ ID NO: 

II or bp 1020-1039 of SEQ ID NO: 1, and a 3' primer corresponding 
to the complement of approximately bp 510-491 of SEQ ID NO: 11. 

30 It should also be noted that the probes and primers may 

include specific mutated nucleotides. Thus, for example, a 
hybridization probe or 5' primer may be produced for the C410y 
mutation comprising a sequence corresponding to approximately bp 
1468-1486 of SEQ ID NO: 1 to screen for or amplify normal 

35 alleles, or corresponding to the same sequence but with the bp 
corresponding to bp 1477 altered {G-»t) to screen for or an^lify 
mutant alleles, 
(b) PS2 Probes and Primers 

The same general considerations described above with respect 

40 to probes and primers for PSl, apply equally to probes and 
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primers for PS2. In particular, the probes or primers may 
correspond to intron, exon or intron/exon boundary secjuences, may 
correspond to sequences from the coding or non-coding (auitisense) 
strands, and may correspond to normal or mutant sequences. 
5 Merely as examples, the PSl N141I mutation {A-»T at bp 787) 

may be screened for by PGR amplification of the surroxinding DNA 
fragment using a 5' primer corresponding to approximately bp 733- 
751 of SEQ ID NO: 18 and a 3' primer corresponding to the 
complement of approximately bp 846-829 of SEQ ID NO: 18. 

10 Similarly, a 5' primer for the M239V mutation {A-^G at bp 1080) 
may comprise a sequence corresponding to approximately bp 1009- 
1026 auid a 3' primer may correspond to the complement of 
approximately bp 1118-1101 of SEQ ID NO: 18. As another example, 
the sequence encoding the region surrounding the I420T mutation 

15 (T-k: at bp 1624) may be screened for by PGR amplification of 

genomic DNA using a 5' primer corresponding to approximately bp 
1576-1593 of SEQ ID NO: 18 and a 3' primer corresponding to the 
complement of approximately bp 1721-1701 of SEQ ID NO: 18 to 
generate a 146 base pair product. This product may, for example, 

20 then be probed with allele specific oligonucleotides for the 
wild-type (e.g., bp 1616-1632 of SEQ ID NO: 18) and/or mutant 
<e.g., bp 1616-1632 of SEQ ID NO: 18 with T-^ at bp 1624) 
sequences . 

(2) Hybridization Screening 

25 For in situ detection of a normal or mutant PSl, PS2 or 

other presenilin-related nucleic acid sequence, a sample of 
tissue may be prepared by standard techniques and then contacted 
with one or more of the cQjove-described probes, preferably one 
which is labeled to facilitate detection, and an assay for 

30 nucleic acid hybridization is conducted under stringent 

conditions which permit hybridization only between the probe and 
highly or perfectly complementary sequences. Because most of the 
PSl cuid PS2 mutations detected to date consist of a single 
nucleotide substitution, high stringency hybridization conditions 

35 will be required to distinguish normal sequences from most mutant 
sequences. When the presenilin genotypes of the subject's 
parents are )cnown, probes may be chosen accordingly. 
Alternatively, probes to a variety of mutants may be employed 
sequentially or in combination. Because most individuals 

40 carrying presenilin mutants will be heterozygous, probes to 
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noirmal eeguences also may be employed and homozygous normal 
individuals may be distinguished from mutant heterozygotes by the 
amount of binding (e.g., by intensity of radioactive signal). In 
another variation, competitive binding assays may be employed in 
5 which both normal and mutant probes are used but only one is 
labeled. 

(3) Restriction Mapping 

Sequence alterations may also create or destroy fortuitous 
restriction enzyme recognition sites which are revealed by the 

10 use of appropriate enzyme digestion followed by gel -blot 
hybridization. DNA fragments carrying the site (normal or 
mutant) are detected by their increase or reduction in size, or 
by the increase or decrease of corresponding restriction fragment 
numbers. Such restriction fragment length polymorphism analysis 

15 (RFLP) , or restriction mapping, may be employed with genomic DNA, 
mRNA or cDNA. The presenilin sequences may be amplified by PGR 
using the above -described primers prior to restriction, in which 
case the lengths of the PGR products may indicate the presence or 
absence of particular restriction sites, and/or may be subjected 

20 to restriction after amplification. The presenilin fragments may 
be visualized by any convenient meeuis (e.g., under UV light in 
the presence of ethidium bromide) . 

Merely as examples, it is noted that the PSl M146L mutation 
(A-»G at bp 684 of SEQ ID NO: 1) destroys a PsphI site; the H163R 

25 mutation {A-*G at bp 736) destroys an Nlalll site; the A246E 
mutation (G-»A at bp 985) creates a Ddel site; and the L286V 
mutation (G-»G at bp 1104) creates a PvuIII site. One of ordinary 
skill in the art may easily choose from the many commercially 
available restriction enzymes and, based upon the normal cuid 

30 mutsoit sequences disclosed auid otherwise enabled herein, perform 
a restriction mapping analysis which will detect virtually any 
presenilin mutation. 

(4) PGR Mapping 

In another series of embodiments, a single base substitution 
35 mutation may be detected based on differential PGR product length 
or production in PGR. Thus, primers which span mutant sites or 
which, preferably, have 3' termini at mutation sites, may be 
employed to amplify a sample of genomic DNA, mRNA or cDNA from a 
sxibject. A mismatch at a mutational site may be expected to 
40 alter the ability of the normal or mutant , primers to promote the 
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polymerase reaction and, thereby, result in product profiles 
which differ between normal subjects and heterozygous and/or 
homozygous presenilin mutants. The PGR products of the normal 
and mutant gene may be differentially separated and detected by 
5 standard techniques, such as polyacrylaraide or agarose gel 

electrophoresis and visualization with labeled probes, ethidium 
bromide or the like. Because of possible non-specific priming or 
readthrough of mutation sites, as well as the fact that most 
carriers of mutant alleles will be heterozygous, the power of 
10 this technique may be low. 

(5) Elect ronhoret i r Mnh;i ;i -i ^ y 

Genetic testing based on DNA sequence differences also may 
be achieved by detection of alterations in electrophoretic 
mobility of DNA, mRNA or cDNA fragments in gels. Small sequence 

15 deletions and insertions, for example, can be visualized by high 
resolution gel electrophoresis of single or double stranded DNA, 
or as changes in the migration pattern of DNA heteroduplexes in 
non-denaturing gel electrophoresis. Presenilin mutations or 
polymorphisms may also be detected by methods which exploit 

20 mobility shifts due to single-stranded conformational 

polymorphisms (SSCP) associated with mRNA or single-stranded DNA 
secondary structures. 

(6) Chemical cleavage of Mlfinla^r^ftQ 

Mutations in the presenilins may also be detected by 

25 employing the chemical cleavage of mismatch (CCM) method (see, 

e.g., Saleeba and Cotton, 1993, and references therein). In this 
technique, probes (up to - i kb) may be mixed with a sample of 
genomic DNA, cDNA or mRNA obtained from a subject. The sample 
and probes are mixed and subjected to conditions which allow for 

30 heteroduplex formation (if any) . Preferably, both the probe and 
sample nucleic acids are double-stranded, or the probe and sample 
may be PGR amplified together, to ensure creation of all possible 
mismatch heteroduplexes. Mismatched T residues are reactive to 
osmium tetroxide and mismatched C residues are reactive to 

35 hydroxylamine. Because each mismatched A will be accompanied by 
a mismatched T, and each raisraatchesd G will be ac:con5>ani;ed. by a 
mismatched C, any nucleotide differences between the probe and 
sample (including small insertions or deletions) will lead to the 
formation of at least one reactive heteroduplex. After treatment 

40 with osmium tetroxide and/or hydroxylamine to modify any mismatch 
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sites, the mixture is siibjected to chemical cleavage at any 
modified mismatch sites by, for example, reaction with 
piperidine. The mixture may then be analyzed by standard 
techniques such as gel electrophoresis to detect cleavage 
5 products which would indicate mismatches between the probe and 
sample. 

(7) 0^;h?r f1et;)i9(^g 

Various other methods of detecting presenilin mutations, 
based upon the presenilin sequences disclosed and otherwise 

10 enabled herein, will be apparent to those of ordinary skill in 
the art. Any of these may be employed in accordance with the 
present invention. These include, but are not limited to, 
nuclease protection assays (SI or ligase-mediated) , ligated PGR, 
denaturing gradient gel electrophoresis (DGGE; see, e.g., Fischer 

15 and Lerman, 1983), restriction endonuclease fingerprinting 

combined with SSCP (REF-SSCP; see, e.g., Liu and Sommer, 1995), 
auid the like. 

Other Screens an d DiacmQatigs 
In inherited cases, as the primary event, and in non- 
20 inherited cases as a secondary event due to the disease state, 
abnormal processing of PSl, PS2, APP, or proteins reacting with 
PSl, PS2, or APP may occur. This can be detected as abnormal 
phosphorylation, glycosylation, glycation amidation or 
proteolytic cleavage products in body tissues or fluids (e.g., 

25 CSF or blood) . 

Diagnosis also can be made by observation of alterations in 
presenilin transcription, translation, and post-translational 
modification and processing as well as alterations in the 
intracellular and extracellular trafficking of presenilin gene 

30 products in the brain and peripheral cells. Such changes will 
include alterations in the amount of presenilin messenger RNA 
and/ or protein, alteration in phosphorylation state, abnormal 
intracellular location/distribution, abnormal extracellular 
distribution, etc. Such assays will include: Northern Blots 

35 (with presenilin- specific and non-specific nucleotide probes) , 
Western blots ^uid enzyme-linked immunosorbent; assays (ELISA) 
(with antibodies raised specifically to a presenilin or 
presenilin functional domain, including various post- 
translational modification states including glycosylated and 

40 phosphorylated isoforms) . These assays can be performed on 
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peripheral tissues (e.g., blood cells, plasma, cultured or other 
fibroblast tissues, etc.) as well as on biopsies of CNS tissues 
obtained antemortera or postmortem, and upon cerebrospinal fluid. 
Such assays might also include in situ hybridization and 
5 immunohistochemistry (to localize messenger RNA and protein to 
specific subcellular compartments and/or within neuropathological 
structures associated with these diseases such as neurofibrillary 
tangles and amyloid plagues) . 
E. Screening and Diag nostic Kits 

10 In accordance with the present invention, diagnostic kits 

are also provided which will include the reagents necessary for 
the above -described diagnostic screens. For example, kits may be 
provided which include antibodies or sets of antibodies which are 
specific to one or more mutant epitopes. These amtibodies may, 

15 in particular, be labeled by any of the standard means which 

facilitate visualization of binding. Alternatively, kits may be 
provided in which oligonucleotide probes or PGR primers, as 
described above, are present for the detection and/or 
amplification of mutant PSl, PS2 or other presenilin-related 

20 nucleotide sequences. Again, such probes may be labeled for 
easier detection of specific hybridization. As appropriate to 
the various diagnostic embodiments described above, the 
oligonucleotide probes or antibodies in such kits may be 
immobilized to substrates and appropriate controls may be 

25 provided. 

10. Methods of Treatment 

The present invention now provides a basis for therapeutic 
intervention in diseases which are caused, or which may be 
caused, by mutations in the presenilins. As detailed above, 

30 mutations in the hPSl and hPS2 genes have been associated with 
the development of early onset forms of Alzheimer's Disease and, 
therefore, the present invention is particularly directed to the 
treatment of subjects diagnosed with, or at risk of developing, 
Alzheimer's Disease. In view of the expression of the PSl and 

35 PS2 genes in a variety of tissues, however, it is quite likely 

that the effects of mutations at these loci are not restricted to 
the brain and, therefore, may be causative of disorders in 
addition to Alzheimer's Disease. Therefore, the present 
invention is also directed at diseases manifest in other tissues 

40 which may arise from mutations, mis -express ion, mis -metabolism or 
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other inherited or acquired alterations in the presenilin genes 
and gene products. In addition, although Alzheimer's Disease 
manifests as a neurological disorder, this manifestation may be 
caused by mutations in the presenilins which first affect other 
5 organ tissues (e.g., liver), which then release factors which 
affect brain activity, and ultimately cause Alzheimer's Disease. 
Hence, in considering the various therapies described below, it 
is understood that such therapies may be targeted at tissue other 
than the brain, such as heart, placenta, lung, liver, skeletal 
muscle, kidney and pancreas, where PSl and/or PS2 are also 
expressed. 

Without being bound to any particular theory of the 
invention, the effect of the Alzheimer's Disease related 
mutations in the presenilins appears to be a gain of a novel 
function, or an acceleration of a normal function, which directly 
or indirectly causes aberrant processing of the Amyloid Precursor 
Protein (APP) into A^ peptide, abnormal phosphorylation 
homeostasis, and/ or abnormal apoptosis in the brain. Such a gain 
of function or acceleration of function model would be consistent 
with the adult onset of the symptoms and the dominant inheritance 
of Alzheimer's Disease. Nonetheless, the mechanism by which 
mutations in the presenilins may cause these effects remains 
unknown. 

It is known that APP may be metabolized through either of 
25 two pathways. In the first, APP is metabolized by passage 
through the Golgi network and then to secretory pathways via 
clathrin-coated vesicles. Mature APP is then passaged to the 
plasma membrane where it is cleaved by a-secretase to produce a 
soluble fraction (Protease Nexin II) plus a non-amyloidogenic C- 
30 terminal peptide (Selkoe et al., 1995; Gandy et al., 1993). 
Alternatively, mature APP can be directed to the endosome- 
lysosome pathway where it undergoes /S and T-secretase cleavage to 
produce the A/S peptides. The A/S peptide derivatives of APP are 
neurotoxic (Selkoe et al., 1994). The phosphorylation state of 
35 the cell determines the relative balance between the o-secretase 
(non-amyloidogenic) or JWJ pathways (amylpidogenic pathway) (Gandy 
et al. 1993), and can be modified pharmacologically by phorbol 
esters, muscarinic agonists and other agents. The 
phosphorylation state of the cell appears to be mediated by 
cytosolic factors (especially protein kinase C) acting upon one 
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or more integral membrane proteins in the Golgi network. 

Without being bound to any particular theory of the 
invention, the presenilins, in particular hPSl or hPS2 (which 
carry several phosphorylation consensus sequences for protein 
5 kinase C] , may be the integral membrane proteins whose 

phosphorylation state determines the relative balance between the 
a-secretase and A/3 pathways. Thus, mutations in the PSl or PS2 
genes may cause alterations in the structure and function of 
their products leading to defective interactions with regulatory 

10 elements (e.g., protein kinase C) or with APP, thereby promoting 
APP to be directed to the amyloidogenic endosome-lysosome 
pathway. Environmental factors (e.g., viruses, toxins, or aging) 
may also have similar effects on PSl or PS2. 

Again without being bound to any particular theory of the 

15 invention, it is also noted that both the PSl and PS2 proteins 
have sxibstantial amino acid sequence homology to hximan ion 
channel proteins and receptors. For instance, the PS2 protein 
shows substantial homology to the humcui sodium channel a-s\ibunit 
(E-0.18, P-0.16, identities = 22-27% over two regions of at least 

20 35 amino acid residues) using the BLASTP paradigm of Altschul et 
al. (1990). Other diseases (such as malignant hyperthermia and 
hyperkalemic periodic paralysis in humans, and the degeneration 
of mechanosensory neurons in C. eleaans ) arise through mutations 
in ion channels or receptor proteins. Mutation of the PSl or PS2 

25 gene could, therefore, affect similar functions and lead to 
Alzheimer's Disease and/or other psychiatric and neurological 
diseases. 

Therapies to treat presenilin-associated diseases such as AD 
may be based upon (1) administration of normal PSl or PS2 

30 proteins, (2) gene therapy with normal PSl or PS2 genes to 
compensate for or replace the mutant genes, (3) gene therapy 
based upon antisense sequences to mutant PSl or PS2 genes or 
which "knock-out" the mutant genes, (4) gene therapy based upon 
sequences which encode a protein which blocks or corrects the 

35 deleterious effects of PSl or PS2 mutants, (5) immunotherapy 
based upon antibodies to normal and/or mutauit PSl or PS2 
proteins, or (6) small molecules (drugs) which alter PSi or PS 2 
expression, block abnormal interactions between mutant forms of 
PSl or PS2 and other proteins or ligands, or which otherwise 

40 block the aberrant function of mutant PSl or PS 2 proteins by 
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altering the structure of the mutant proteins, by enhancing their 
metabolic clearance, or by inhibiting their function. 

A. Protej.^ T^grWY 

Treatment of presenilin-related Alzheimer's Disease, or 
5 other disorders resulting from presenilin mutations, may be 

performed by replacing the mutant protein with normal protein, by 
modulating the function of the mutant protein, or by providing an 
excess of normal protein to reduce the effect of any aberrant 
function of the mutant proteins. 

To accomplish this, it is necessary to obtain, as described 
and enabled herein, large amounts of substantially pure PSl 
protein or PS2 protein from cultured cell systems which can 
express the protein. Delivery of the protein to the affected 
brain areas or other tissues can then be accomplished using 
15 appropriate packaging or administrating systems including, for 
example, liposome mediated protein delivery to the target cells. 

B. Gene Theranv 

In one series of embodiments, gene therapy is may be 
employed in which noinnal copies of the PSl gene or the PS2 gene 

20 are introduced into patients to code successfully for normal 

protein in one or more different affected cell types. The gene 
must be delivered to those cells in a form in which it can be 
taken up and code for sufficient protein to provide effective 
function. Thus, it is preferred that the recombinant gene be 

25 operably joined to a strong promote so as to provide a high level 
of expression which will compensate for, or out-compete, the 
mutant proteins. As noted above, the recombinant construct may 
contain endogenous or exogenous regulatory elements, inducible or 
repressible regulatory elements, or tissue-specific regulatory 

30 elements. 

In another series of embodiments, gene therapy may be 
employed to replace the mutant gene by homologous recombination 
with a recombinant construct. The recombinant construct may 
contain a normal copy of the targeted presenilin gene, in which 

35 case the defect is corrected in situ , or may contain a -knock- 
out" construct which introduces a stop codon, missense mutation, 
or deletion which abolished function oiE the mutant gene. It 
should be noted in this respect that such a construct may knock- 
out both the normal and mutant copies of the targeted presenilin 

40 gene iii a heterozygous individual, but the total loss of 
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presenilin gene function may be less deleterious to the 
individual than continued progression of the disease state. 

In another series of embodiments, antisense gene therapy may 
be employed. The antisense therapy is based on the fact that 
S sequence-specific suppression of gene expression can be achieved 
by intracellular hybridization between mRNA or DNA and a 
complementary antisense species. The formation of a hybrid 
duplex may then interfere with the transcription of the gene 
and/or the processing, transport, translation and/or stability of 

10 the target presenilin mRNA, Antisense strategies may use a 

variety of approaches including the administration of antisense 
oligonucleotides or antisense oligonucleotide analogs (e,g,, 
analogs with phosphorothioate backbones) or tremsfection with 
antisense RNA expression vectors. Again, such vectors may 

15 include exogenous or endogenous regulatory regions, inducible or 
repressible regulatory elements, or tissue- specific regulatory 
elements . 

In another series of embodiments, gene therapy may be used 
to introduce a recombinant construct encoding a protein or 

20 peptide which blocks or otherwise corrects the aberrant function 
caused by a mutant presenilin gene. In one embodiment, the 
recombinant gene may encode a peptide which corresponds to a 
mutant domain of a presenilin which has been found to abnormally 
interact with another cell protein or other cell ligand. Thus, 

25 for example, if a mutant TM6->7 domain is found to interact with a 
particular cell protein but the corresponding normal TM6-»7 domain 
does not undergo this interaction, gene therapy may be employed 
to provide an excess of the mutant TM6-^7 domain which may compete 
with the mutant protein and inhibit or block the aberrant 

30 interaction. Alternatively, the portion of a protein which 
interacts with a mutant, but not a normal, presenilin may be 
encoded and expressed by a recombinant construct in order to 
compete with, and thereby inhibit or block, the aberrant 
interaction. Finally, in another embodiment, the same effect 

3 5 might be gained by inserting a second mutant protein by gene 
therapy in fUi approach similar to the correction of the "Deg 
1(d)" and "Mec 4(d)" mutations in C. eleaans by insertion of 
mutant treuisgenes . 

Retroviral vectors can be used for somatic cell gene therapy 

40 especially because of their high efficiency of infection and 
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Stable integration and expression. The targeted cells however 
roust be able to divide and the expression of the levels of normal 
protein should be high because the disease is a dominant one. 
The full length PSl or PS2 genes, subsequences encoding 
5 functional domains of the presenilins, or any of the other 
therapeutic peptides described above, can be cloned into a 
retroviral vector and driven from its endogenous promoter, from 
the retroviral long terminal repeat, or from a promoter specific 
for the target cell type of interest (e.g., neurons). Other 
10 viral vectors which can be used include adeno-associated virus, 
vaccinia virus, bovine papilloma virus, or a herpes virus such as 
Epstein-Barr virus. 
C. Immunotherapy 

Immunotherapy is also possible for Alzheimer's Disease. 

15 Antibodies are raised to a mutant PSl or PS2 protein (or a 

portion thereof) and are administered to the patient to bind or 
block the mutant protein and prevent its deleterious effects. 
Simultaneously, expression of the normal protein product could be 
encouraged. Alternatively, antibodies are raised to specific 

20 complexes between mutant or wild- type PSl or PS2 and their 
interaction partners. 

A further approach is to stimulate endogenous antibody 
production to the desired antigen. Administration could be in 
the form of a one time immunogenic preparation or vaccine 

25 immunization. An immunogenic composition may be prepared as 
injectables, as liquid solutions or emulsions. The PSl or PS2 
protein or other antigen may be mixed with pharmaceutically 
acceptable excipients compatible with the protein. Such 
excipients may include water, saline, dextrose, glycerol, ethanol 

30 and combinations thereof. The immunogenic composition and 
vaccine may further contain auxiliary siibstances such as 
emulsifying agents or adjuvants to enhance effectiveness. 
Immunogenic compositions and vaccines may be administered 

parenterally by injection subcutaneously or intramuscularly. 
35 The immunogenic preparations and vaccines are administered 

in such amount as will be thera^peutically effective, prote^^ 
and immunogenic. Dosage depends on the route of administration 
and will vary according to the size of the host. 
D. Small Molecule Therapent-j^p 
40 As described and enabled herein, the present invention 
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provides for a nvunber of methods of identifying small molecules 
or other compounds which may be useful in the treatment of 
Alzheimer's Disease or other disorders caused by mutations in the 
presenilins. Thus, for example, the present invention provides 
for methods of identifying presenilin binding proteins and, in 
particular, methods for identifying proteins or other cell 
components which bind to or otherwise interact with mutant 
presenilins but not with the normal presenilins. The invention 
also provides for methods of identifying small molecules which 
can be used to disrupt aberrant interactions between mutant 
presenilins and such proteins or other cell con^onents. 

Such interactions, involving mutant but not normal 
presenilins, not only provide information useful in understanding 
the biochemical pathways disturbed by mutations in the 
15 presenilins, and causative of Alzheimer's Disease, but also 
provide immediate therapeutic targets for intervention in the 
etiology of the disease. By identifying these proteins and 
analyzing these interactions, it is possible to screen for or 
design compounds which counteract or prevent the interaction, 
20 thus providing possible treatment for abnormal interactions. 

These treatments would alter the interaction of the presenilins 
with these partners, alter the ftinction of the interacting 
protein, alter the amount or tissue distribution or expression of 
the interaction partners, or alter similar properties of the 
25 presenilins themselves. 

Therapies can be designed to modulate these interactions and 
thus to modulate Alzheimer's Disease and the other conditions 
associated with acquired or inherited abnormalities of the PSl or 
PS2 genes or their gene products. The potential efficacy of 
30 these therapies can be tested by analyzing the affinity and 

function of these interactions after exposure to the therapeutic 
agent by standard pharmacokinetic measurements of affinity (Kd 
and Vmax etc.) using synthetic peptides or recombinant proteins 
corresponding to fvinctional domains of the PSl gene, the PS2 gene 
35 or other presenilin homologues. : Another method for assaying the 
effect of any interactions, involving functional domains such as 
the hydrophilic loop is to monitor changes in the intracellular 
trafficking and post-translational modification of the relevant 
genes by in eitU hybridization, immunohistochemistry, Western 
blotting and metabolic pulse-chase labeling studies in the 
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presence of, and in the absence of, the therapeutic agents. A 
further method is to monitor the effects of "downstream" events 
including (i) changes in the intracellular metabolism, 
trafficking and targeting of APP and its products; (ii) changes 
5 in second messenger events, e.g., cAMP intracellular Ca»*, protein 
kinase activities, etc. 

As noted above, the presenilins may be involved in APP 
metabolism and the phosphorylation state of the presenilins may 
be critical to the balance between the a-secretase and Afi 
10 pathways of APP processing. Using the transformed cells and 

animal models of the present invention, one is enabled to better 
understand these pathways and the aberrant events which occur in 
presenilin mutants. Using this knowledge, one may then design 
therapeutic strategies to counteract the deleterious affects of 
15 presenilin muteuits. 

To treat Alzheimer's Disease, for example, the" 
phosphorylation state of PSl and/or can be altered by chemical 
and biochemical agents (e.g. drugs, peptides and other compounds) 
which alter the activity of protein kinase C and other protein 
20 kinases, or which alter the activity of protein phosphatases, or 
which modify the availability of PSl to be post-translationally 
modified. The interactions of kinases and phosphatases with the 
presenilin proteins, and the interactions of the presenilin 
proteins with other proteins involved in the trafficking of APP 
25 within the Golgi network, can be modulated to decrease 

trafficking of Golgi vesicles to the endosome-lysosome pathway, 
thereby inhibiting A^ peptide production. Such compounds will 
include peptide analogues of APP, PSl, PS2, and other presenilin 
homologues, as well as other interacting proteins, lipids, 
3 0 sugars, and agents which promote differential glycosylation of 

PSl, PS2 and/or their homologues; agents which alter the biologic 
half -life of presenilin mRNA or proteins, including antibodies 
and antisense oligonucleotides; and agents which act upon PSl 
auxd/or PS2 transcription. 
35 The effect Of these agents in cell lines and whole animals 

. can be monitored by monitoring transcription, translation,, and . 
post-treuislational modification of PSl and/or PS2 (e.g. 
phosphorylation or glycosylation) , as well as intracellular 
trafficking of PSl and/or PS2 through various intracellular and 
40 extracellular compartments. Methods for these studies include 
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Western and Northern blots, iramunoprecipitation after metabolic 
labelling (pulse-chase) with radio- labelled methionine and ATP, 
and immxinohistochemistry. The effect of these agents can also be 
monitored using studies which examine the relative binding 
5 affinities and relative amounts of PSl and/or PS2 proteins 

involved in interactions with protein kinase C and/or APP, using 
either standard binding affinity assays or co-precipitation and 
Western blots using emtibodies to protein kinase APP, PSl, 
PS2, or other presenilin homologues. The effect of these agents 
10 can also be monitored by assessing the production of AjS peptides 
by ELISA before and after exposure to the putative therapeutic 
agent (see, e.g., Huang et al., 1993). The effect can also be 
monitored by assessing the viability of cell lines after exposure 
to aluminum salts and/or the Aj5 peptides which are thought to be 
15 neurotoxic in Alzheimer's Disease. Finally, the effect of these 
agents can be monitored by assessing the cognitive function of 
animals bearing normal genotypes at APP and/or their presenilin 
homologues, bearing human APP transgenes (with or without 
mutations) , bearing human presenilin transgenes (with or without 
20 mutations), or bearing amy combination of these. 

Similarly, as noted above, the presenilins may be involved 
in the regulation of Ca^* as receptors or ion channels. This role 
of the presenilins also may be explored using the treuisformed 
cell lines and finimal models of the invention. Based upon these 
25 results, a test for Alzheimer's Disease can be produced to detect 
an abnormal receptor or an abnormal ion channel function related 
to abnormalities that are acquired or inherited in the presenilin 
genes and their products, or in one of the homologous genes and 
their products. This test can be accomplished either in vivo or 
30 in vitro by measurements of ion channel fluxes and/or 

transmembrane voltage or current fluxes using patch clamp, 
voltage clamp and fluorescent dyes sensitive to intracellular 
calcium or transmembrane voltage. Defective ion chauinel or 
receptor function can also be assayed by measurements of 
35 activation of second messengers such as cyclic AMP, cGMP tyrosine 
kinases, phosphates, increases in intracellular Ca^* leyels, etq. 
Recombinantly made proteins may also be reconstructed in 
artificial membrane systems to study ion channel conductance. 
Therapies which affect Alzheimer's Disease (due to 
40 acquired/inherited defects in the PSl gene or PS2 gene; due to 
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defects in other pathways leading to this disease such as 
mutations in APP; and due to environmental agents) can be tested 
by analysis of their ability to modify an abnormal ion channel or 
receptor function induced by mutation in a presenilin gene. 
5 Therapies could also be tested by their ability to modify the 
normal function of an ion channel or receptor capacity of the 
presenilin proteins. Such assays can be performed on cultured 
cells expressing endogenous normal or mutant PSl genes/gene 
products or PS2 genes/gene products. Such studies also can be 

10 performed on cells transfected with vectors capable of expressing 
one of the presenilins, or functional domains of one of the 
presenilins, in normal or mutant form. Therapies for Alzheimer's 
Disease can be devised to modify an abnormal ion channel or 
receptor function of the PSl gene or PS2 gene. Such therapies 

15 can be conventional drugs, peptides, sugars, or lipids, as well 
as antibodies or other ligands which affect the properties of the 
PSl or PS2 gene product. Such therapies can also be performed by 
direct replacement of the PSl gene and/or PS2 gene by gene 
therapy. In the case of an ion channel, the gene therapy could 

20 be performed using either mini -genes (cDNA plus a promoter) or 

genomic constructs bearing genomic DNA sequences for parts or all 
of a presenilin gene. Mutant presenilins or homologous gene 
sequences might also be used to counter the effect of the 
inherited or acquired abnormalities of the presenilin genes as 

25 has recently been done for replacement of the Mec 4 and Deg 1 in 
g, el^qans (Huang and Chalfie, 1994) . The therapy might also be 
directed at augmenting the receptor or ion channel fxmction of 
one homologue, such as the PS2 gene, in order that it may 
potentially take over the functions of a mutant form of another 

30 horaologue (e.g., a PSl gene rendered defective by acquired or 

inherited defects) . Therapy using antisense oligonucleotides to 
block the expression of the mutant PSl gene or the mutant PS2 
gene, co-ordinated with gene replacement with normal PSl or PS2 
gene can also be applied^ using standard techniques of either gene 

35 therapy or protein replacement therapy, 

Bxampleg 

BXaWtPlg 1, PgvfilOPment of the genetic, o hvsical "contia" and 

transcriptional man of the minima l co«seareaatinq region , 

The CEPH MegaYAC and the RPCl PAC human total genomic DNA 
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libraries were searched for clones containing genomic DNA 
fragments from the ADS region of chromosome 14q24.3 using 
oligonucleotide probes for each of the 12 SSR marker loci used in 
the genetic linkage studies as well as additional markers 
5 (Albertsen et al,, 1990; Chumakov et al., 1992; loannu et al., 
1994) , The genetic map distances between each marker are 
depicted above the contig, and are derived from published data 
(NIH/CEPH Collaborative Mapping Group, 1992; Wang, 1992; 
Weissenbach et al., 1992; Gyapay et al., 1994). Clones recovered 

10 for each of the initial marker loci were arranged into an ordered 
series of partially overlapping clones ("contig") using four 
independent methods. First, sequences representing the ends of 
the YAC insert were isolated by inverse PGR (Riley et al., 1990), 
and hybridized to Southern blot panels containing restriction 

15 digests of DNA from all of the YAC clones recovered for all of 
the initial loci in order to identify other YAC clones bearing 
overlapping sequences. Second, inter-Alu PCR was performed on 
each YAC, and the resultant band patterns were compared across 
the pool of recovered YAC clones in order to identify other 

20 clones bearing overlapping sequences (Bellamne-Chartelot et al., 
1992; Chumakov et al., 1992). Third, to improve the specificity 
of the Alu-PCR fingerprinting, the YAC DNA was restricted with 
Haelll or Rsal, the restriction products were amplified with both 
Alu and LIH consensus primers, and the products were resolved by 

25 polyacrylamide gel electrophoresis. Finally, as additional STSs 
were generated during the search for transcribed sequences, these 
STSs were also used to identify overlaps. The resultant contig 
was complete except for a single discontinuity between YAC932C7 
bearing D14S53 and YAC746B4 containing D14S61. The physical map 

30 order of the STSs within the contig was largely in accordance 
with the genetic linkage map for this region (NIH/CEPH 
Collaborative Mapping Group, 1992; Wang and Weber, 1992; 
Weissenbach et al,, 1992; Gyapay et al., 1994). However, as with 
the genetic maps, it was not possible to resolve unambiguously 

35 the relative order of the loci within the D14S43/D14S*7l cluster 
and the D14S76/D14S273 cluster. PACl clones suggested that 
D14S277 is telomeric to D14S268, whereas genetic maps have 
suggested the reverse order . Furthermore , a few STS probes 
failed to detect hybridization patterns in at least one YAC clone 

40 whlchL, on the basis of the most parsimonious consensus physical 
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map and from the genetic map, would have been predicted to 
contain that STS, For instance, the D14S268 (AFM265) and RSCAT7 
STSs are absent from YAC788H12. Because these results were 
reproducible, and occurred with several different STS markers, 
5 these results most likely reflect the presence of small 
interstitial deletions within one of the YAC clones. 

Exaffiplg Cumulative two-noint lod s cores for chromosome 

14a24,3 markers. 

Genotypes at each polymorphic microsatellite marker locus 

10 were determined by PGR from lOOng of genomic DNA of all available 
affected and unaffected pedigree members as previously described 
(St, George-Hyslop et al., 1992) using primer sequences specific 
for each microsatellite locus (Weissenbach et al., 1992; Gyapay 
et al,, 1994). The normal population frequency of each allele 

15 was determined using spouses and other neurologically normal 
subjects from the same ethnic groups, but did not differ 
significantly from those established for mixed Caucasian 
populations (Weissenbach et al., 1992; Gyapay et al., 1994). The 
maximum likelihood calculations assumed an age of onset 

20 correction, marker allele frequencies derived from ptiblished 
series of mixed Caucasian subjects, euid an estimated allele 
frequency for .the AD3 mutation of 1:1000 as previously described 
(St. George-Hyslop et al., 1992). The analyses were repeated 
using equal marker allele frequencies, and using phenotype 

25 information only from affected pedigree members as previously 

described to ensure that inaccuracies in the estimated parameters 
used in the msucimum likelihood calculations did not misdirect the 
analyses (St. George-Hyslop et al., 1992). These supplemental 
analyses did not significantly alter either the evidence 

3 0 supporting linkage, or the discovery of recombination events. 

Example ?. Haplotvpes between flankin g markers segregate with 

^3 j.n FAP, 

Extended haplotypes between the centromeric and telomeric 
flanking markers on the parental copy of chromosome 14 

35 segregating with AD3 in fourteen early onset FAD pedigrees 

(pedigrees NIH2, MGHl, Torl.l, FAD4, FADl, MEXl, and FAP2) show 
pedigree specific lod scores > +3.00 with at least one marker 
between D14S258 and D14S53. Identical partial haplotypes are 
observed in two regions of the disease bearing chromosome 

40 segregating in several pedigrees of similar ethnic origin. In 
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region A, shared alleles are seen at D14S268 ("B": allele size = 
126 bp, allele frequency in normal Caucasians - 0.04; "C": size c 
124 bp, frequency = 0.38); D14S277 ("B": size « 156 bp, frequency 
- 0.19; "C": size = 154 bp, frequency = 0.33); and RSCAT6 ("D": 
size - lllbp, frequency 0.25; "E" : size « 109bp, frequency - 
0.20; "F": size = 107 bp, frequency « 0.47). In region B, 
alleles of identical size are observed at D14S43 ("A": size = 
I93bp, frequency = 0.01; "D": size = 187 bp, frequency = 0.12; 
"E": size = 185 bp, frequency « 0.26; "I": size » 160 bp, 
frequency = 0.38); D14S273 ("3": size - 193 bp, frequency - 0.38; 
"4" size « 191 bp, frequency - 0.16; "5": size ■ 189 bp, 
frequency = 0.34; "6": size » 187 bp, frequency » 0.02) and 
D14S76 ("1": size = bp, frequency = 0.01; "5": size « bp, 
frequency - 0.38; "6": size « bp, frequency - 0.07; "9": size » 
15 bp, frequency « 0.38). See Sherrington et al. (1995) for 
details. 

gx^mp^e 4. Recovery of trans cribed seouencea from the Apg 

interval . 

Putative transcribed sequences encoded in the AD3 interval 

were recovered using a direct hybridization method in which short 

cDNA fragments generated from human brain mRNA were hybridized to 

immobilized cloned genomic DNA fragments (Rommens et al., 1993). 

The resultant short putatively transcribed sequences were used as 

probes to recover longer transcripts from human brain cDNA 

25 libraries (Stratagene, La Jolla) . The physical locations of the 

original short clone and of the subsequently acquired longer cDNA 

clones were established by analysis of the hybridization pattern 

generated by hybridizing the probe to Southern blots containing a 

panel of EcoRI digested total DNA samples isolated from 

30 individual YAC clones within the contig. The nucleotide sequence 

of each of the longer cDNA clones was determined by automated 

cycle sequencing (Applied Biosystems Inc., CA) , and compared to 

other sequences in nucleotide and protein databases using the 

blast algorithm (Altschul et al., 1990). Accession numbers for 

35 the transcribed sequences arie: L40391, L40392, L40393, L4d394, 

v^40395, L40396, L40397, L40398;, L4d399, L40400, L40401, L40402, 
and L40403. . 

^^"^^^ t-ggfttim mutations in the PSi g^ne using reatrlg^^on 

enzvmea . 

40 The presence of the A246E mutation, which creates a Ddel 
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restriction site, was assayed in genomic DNA by PCR using an end 
labeled primer corresponding essentially to bp 907-925 of SEQ ID 
NO: 1 and an unlabelled primer corresponding to the complement of 
bp 1010-990 of SEQ ID NO: 1, to amplify an 84bp genomic exon 
5 fragment using lOOng of genomic DNA template, 2mM MgCla, 10 pMoles 
of each primer, 0.5U Tag polymerase, 250 uM dNTPs for 30 cycles 
of 95«»C X 20 seconds, 60«C X 20 seconds, 72*>C X 5 seconds • The 
products were incubated with an excess of Ddel for 2 hours 
according to the manufacturer's protocol, and the resulting 

10 restriction fragments were resolved on a 6% nondenaturing 
polyacrylamide gel and visualized by autoradiography. The 
presence of the mutation was inferred from the cleavage of the 
84bp fragment to due to the presence of a Ddel restriction site. 
All affected members of the FADl pedigree and several at-risk 

15 members carried the Ddel site. None of the obligate escapees 
(those individuals who do not get the disease, age > 70 years) , 
and none of the normal controls carried the Ddel mutation. 

EXftTnplg ^. Locating mutations in the PSl aene using allele 

specific oligonucleotides. 

20 The presence of the C410Y mutation was assayed using allele 

specific oligonucleotides. lOOng of genomic DNA was amplified 
with an exonic sequence primer corresponding to bp 1451-1468 of 
SEQ ID NO: 1 and an opposing intronic sequence primer 
complementary to bp 719-699 of SEQ ID NO: 14 using the above 

25 reaction conditions except 2.5 mM MgCla, and cycle conditions of 
94*C X 20 seconds, 58'C X 20 seconds, and 72'C for 10 seconds). 
The resultant 216bp genomic fragment was denatured by 10 -fold 
dilution in 0.4M NaOH, 25 mM EDTA, and was vacuum slot-blotted to 
duplicate nylon membranes. An end-labeled "wild type" primer 

30 (corresponding to bp 1468-1486 of SEQ ID NO: 1) and an end- 
labeled "mutant" primer (corresponding to the same sequence but 
with a G^A substitution at position 1477) were hybridized to 
separate copies of the slot -blot filters in 5 X SSC, 5 X 
Denhardt's, 0.5% SDS for 1 hour at 48*C, and then washed 

35 successively in 2 X SSC at 23*C and 2 X SSC, 0.1% SDS at SO'C and 
then e35>osed to X-ray film. All testable affected members as 
well as some at-risk members of the AD3 and NIH2 pedigrees 
possessed the C410Y mutation. Attempts to detect the C410Y 
mutation by SSCP revealed that a common intronic sequence 

40 polymorphism migrated with the same SSCP pattern. 
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EyampJ.^ 7. NortheTT | hybridization demonstrating the eacpression 

of PSl protein mR NA in a variety of tissues. 

Total cytoplasmic RNA was isolated from various tissue 
samples (including heart, brain and different regions of 
5 placenta, lung, liver, skeletal muscle, kidney and pancreas) 

obtained from surgical pathology using standard procedures such 
as CsCl purification. The RNA was then electrophoresed on a 
formaldehyde gel to permit size fractionation. The 
nitrocellulose membrane was prepared and the RNA was then 

10 transferred onto the membrane, "P- labeled cDNA probes were 
prepared and added to the membrane in order for hybridization 
between the probe the RNA to occur. After washing, the membrane 
was wrapped in plastic film and placed into imaging cassettes 
containing X-ray film. The autoradiographs were then allowed to 

15 develop for one to several days. Sizing was established by 
comparison to standard RNA markers. Analysis of the 
autoradiographs revealed a prominent band at 3.0 kb in size (see 
Figure 2 of Sherrington et al., 1995). These northern blots 
demonstrated that the PSl gene is expressed in all of the tissues 

20 examined. 

Ex^mp;^ 8- — Eyqcftryptj-c prokarvotic expression vector systems. 

ConstJTucts suitable for use in exikaryotic and prokaryotic 
expression systems have been generated using three different 
classes of PSl nucleotide cDNA sequence inserts. In the first 

25 class, termed full-length constructs, the entire PSl cDNA 

sequence is inserted into the expression plasmid in the correct 
orientation, and includes both the natural 5' UTR and 3' DTR 
sequences as well as the entire open reading frame. The open 
reading frames bear a nucleotide sequence cassette which allows 

30 either the wild type open reading frame to be included in the 
expression system or alternatively, single or a combination of 
double mutations can be inserted into the open reading frame. 
This was accomplished by removing a restriction fragment from the 
wild type open reading frame using the enzymes Narl and Pflml and 

35 replacing it with a similar fragment generated by reverse 

transcriptase PGR and bearing the nucleotide sequence encoding, 
either the M146L mutation or the H163R mutation. A second 
restriction fragment was removed from the wild type normal 
nucleotide sequence for the open reading frame by cleavage with 

40 the enzymes Pflml and Ncol and replaced with a restriction 
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fragment bearing the nucleotide sequence encoding the A246E 
mutation, the A260V mutation, the A285V mutation, the L286V 
mutation, the L392V mutation or the C410Y mutation. A third 
variant, bearing a combination of either the M146L or H163R 
5 mutation in tandem with one of the remaining mutations, was made 
by linking a Narl-Pflml fragment bearing one of the former 
mutations and a Pflml-Ncol fragment bearing one of the latter 
mutations , 

The second class of cDNA inserts, termed trxincated 

10 constructs, was constructed by removing the 5' UTR and part of 
the 3' UTR sequences from full length wild type or mutant cDNA 
sequences. The 5' UTR sequence was replaced with a synthetic 
oligonucleotide containing a Kpnl restriction site (GGTAC/C) and 
a small sequence (GCCACC) to create a Kozak initiation site 

15 around the ATG at the beginning of the PSl ORF (bp 249-267 of SEQ 
ID NO: 1), The 3' UTR was replaced with an oligonucleotide 
corresponding to the complement of bp 2568-2586 of SEQ ID NO: 1 
with an artificial EcoRI site at the 5' end. Mutant variants of 
this construct were then made by inserting the mutant sequences 

20 described above at the Narl-Pflml and Pslml-Ncol sites as 
described above. 

The third class of constructs included sequences derived 
from clone cc44 in which an alternative splice of Exon 4 results 
in the elimination of four residues in the N- terminus (SEQ ID NO: 

25 3) . 

For eukaryotic expression, these various cDNA constructs 
bearing wild type cuid mutant sequences, as described above, were 
cloned into the expression vector pZeoSV in which the SV60 
promoter cassette had been removed by restriction digestion and 

3 0 replaced with the CMV promoter element of pcDNA3 (Invitrogen) . 
For prokaryotic expression, constructs have been made using the 
glutathione S- transferase (GST) fusion vector pGEX-kg. The 
inserts which have been attached to the GST fusion nucleotide 
sequence are the same nucleotide sequences described above 

35 bearing either the normal open reading frame nucleotide sequence, 
or bearing a combination of single and double mutations as 
described above. These GST fusion constructs allow expression of 
the partial or full-length protein in prokaryotic cell systems as 
mutant or wild type GST fusion proteins, thus allowing 

40 purification of the full-length protein followed by removal of 
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the GST fusion product by thrombin digestion. A further cDNA 
construct was made with the GST fusion vector, to allow the 
production of the amino acid sequence corresponding to the 
hydrophilic acidic loop domain between TM6 and TM7 of the full- 
5 length protein, either as a wild type nucleotide sequence or as a 
mutant sequence bearing either the A285V mutation, the L286V 
mutation or the L392V mutation. This was accomplished by 
recovering wild type or mutant sequence from appropriate sources 
of RNA using a 5' oligonucleotide primer corresponding to bp 

10 1044-1061 of SEQ ID NO: 1 with a 5' BamHI restriction site 

(G/GATCC) , and a 3' primer corresponding to the complement of bp 
1476-1458 oh SEQ ID NO: 1 with a 5' EcoRI restriction site 
(G/AATTC) . This allowed cloning of the appropriate mutant or 
wild type nucleotide sequence corresponding to the hydrophilic 

15 acidic loop domain at the BamHI and the EcoRI sites within the 
pGEX-KG vector. 

gyamplQ g. Locating additional mutations in the PSl gene. 

Mutations in the PSl gene can be assayed by a variety of 
strategies (direct nucleotide sequencing, allele specific oligos, 

20 ligation polymerase chain reaction, SSCP, RFLPs) using RT-PCR 
products representing the mature mRNA/cDNA sequence or genomic 
DNA. For the A260V and the A285V mutations, genomic DNA carrying 
the exon can be amplified using the same PGR primers and methods 
as for the L286V mutation. 

25 PGR products were then denatured cuid slot blotted to 

duplicate nylon membranes using the slot blot protocol described 
for the C410y mutation. 

The A260V mutation was scored on these blots by using 
hybridization with end-labeled allele-specif ic oligonucleotides 

30 corresponding to the wild type sequence (bp 1017-1036 of SEQ ID 
NO: 1) or the mutant sequence (bp 1017-1036 of SEQ ID NO: 1 with 
C-^T at bp 1027) by hybridization at 48*C followed by a wash at 
52*C in 3X SSC buffer containing 0.1% SDS. The A285V mutation was 
scored on these slot blots as described above but using instead 

35 the alleie-specif ic oligonucleotides for the wild type sequence 
(bp 1093-1111 of SEQ ID NO: 1) or the? mutant primer (bp 1093-1111 
of SEQ ID NO: 1 with C-#T at bp 1102) at 48*C followed by washing 
at 52"C as above except that the wash solution was 2X SSC. 

The L392V mutation was scored by amplification of the exon 

40 from genomic DNA using primers (5' corresponding to. bp 439-456 of 
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SEQ ID NO: 14 and 3' complementary to 719-699 of SEQ ID NO: 14) 
using standard PGR buffer conditions except that the magnesivun 
concentration was 2mM and cycle conditions were 94*C X lo seconds, 
56*C X 20 seconds, and 12'>C X 10 seconds. The resulting 200 base 
pair genomic fragment was denatured as described for the C4ioy 
mutation and slot-blotted in duplicate to nylon membranes. The 
presence or absence of the mutation was then scored by 
differential hybridization to either a wild type end-labeled 
oligonucleotide (bp 1413-1431 of SEQ ID NO: 1) or with an end- 
labeled mutant primer (bp 1413-1431 of SEQ ID NO: i with C-K3 at 
bp 1422) by hybridization at 45-C and then successive washing in 
2X SSC at 23'C and then at 68*C. 

Pxa^np^g IQ. Antibody nroduction. 

Peptide antigens corresponding to portions of the PSl 
protein were synthesized by solid-phase techniques and purified 
by reverse phase high pressure liquid chromatography. Peptides 
were covalently linked to keyhole limpet hemocyanin (KLH) via 
disulfide linkages that were made possible by the addition of a 
cysteine residue at the peptide C- terminus of the presenilin 
20 fragment. This additional residue does not appear normally in 
the protein sequence and was included only to facilitate linkage 
to the KLH molecule. The specific presenilin sequences to which 
antibodies were raised are as follows: 

Polyclonal antibody # hPSl antigen (SEQ ID NO: 2) 

25 1142 30-44 

519 109-123 

520 304-318 
1143 346-360 

These sequences are contained within specific domains of the 
30 PSl protein. For example, residues 30-44 are within the N- 
terminus, residues 109-123 are within the TMl-»2 loop, and 
residues 304-318 and 346-360 are within the large TM6-»7 loop. 
Each of these domains is exposed to the aqueous media and may be 
involved in binding to other proteins critical for the 
35 development of the disease phenotype. The choice of peptides was 
based on analysis of the protein sequence using the IBI Pustell 
antigenicity prediction algorithm. 

A total of three New Zealand white rabbits were immunized 
with peptide-KLH complexes for each peptide antigen in 
combination with Freund's adjuvant and. were subsequently given 
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booster injections at seven day intervals. Antisera were 
collected for each peptide and pooled and IgG precipitated with 
ammonium sulfate. Antibodies were then affinity purified with 
Sulfo-link agarose (Pierce) coupled with the appropriate peptide. 
5 This final purification is required to remove non-specific 

interactions of other antibodies present in either the pre- or 
post -immune ser\im. 

The specificity of each antibody was confirmed by three 
tests. First, each detected single predominant bands of the 

10 approximate size predicted for presenilin-1 on Western blots of 
brain homogenate. Second, each cross-reacted with recombinant 
fusion proteins bearing the appropriate sequence. Third each 
could be specifically blocked by pre -absorption with recombinant 
PSl or the immunizing peptide. 

15 In addition, two different PSl peptide glutathione S- 

transferase (GST) fusion proteins have been used to generate PSl 
antibodies. The first fusion protein included amino acids 1-81 
{N terminus) of PSl fused to GST. The second fusion protein 
included amino acids 266-410 (the TM6-»7 loop domain) of PSl fused 

20 to GST. Constructs encoding these fusion proteins were generated 
by inserting the appropriate nucleotide sequences into pGEX-2T 
expression plasmid (Amrad) • The resulting constructs included 
sequences encoding GST and a site for thrombin sensitive cleavage 
between GST and the PSl peptide. The expression constructs were 

25 transfected into DH5a E.coli and expression of the fusion 

proteins was induced using IPTG. The bacterial pellets were 
lysed and the soluble GST- fusion proteins were purified by single 
step affinity chromatography on glutathione sepharose beads 
(Boehringer-Mannheim, Montreal) . The GST- fusion proteins were 

30 used to immunize mice to generate monoclonal antibodies using 
standard procedures. Clones obtained from these mice were 
screened with purified presenilin fragments. 

In addition, the GST- fusion proteins were cleaved with 
thrombin to release PSl peptide. The released peptides were 

35 purified by size exclusion HPLC and used to immunize rabbits for 
the generation of polyclonal antisera. 

By similar methods, GST fusion proteins were made using 
constructs including nucleotide sequences for amino acids l to 87 
(N terminus) or 272 to 390 (TM6-»TM7 loop) of preeenilin-2 and 

40 employed to generate monoclonal antibodies to that protein. The 
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PS2-GST fusion proteins were also cleaved with thrombin and the 
released, purified peptides used to immunize rabbits to prepare 
polyclonal antisera. 

Ea^^mplg IX, Identification of mutations in PS2 aent», 

5 RT-PCR products corresponding to the PS2 ORF were generated 

from RNA of lyn^hoblasts or frozen post-mortem brain tissue using 
a first oligonucleotide primer pair with the 5' primer 
corresponding to bp 478-496 of SEQ ID NO: 18, and the 3' primer 
complementary to bp 1366-1348 of SEQ ID NO: 18, for a 888 bp 
10 product, and a second primer pair with the S' primer 

corresponding to bp 1083-1102 of SEQ ID NO: 18, and the 3' primer 
complementary to bp 1909-1892 of SEQ ID NO: 18, for a 826 bp 
product. PGR was performed using 250 mMol dNTPs, 2.5 mM MgCl2, 
10 pMol oligonucleotides in 10 ml cycled for 40 cycles of 94 "C X 
15 20 seconds, 58«C X 20 seconds, 72'C X 45 seconds. The PGR 
products were sequenced by automated cycle sequencing (ABI, 
Poster Gity, CA) and the fluorescent chromatograms were scanned 
for heterozygous nucleotide substitutions by direct inspection 
and by the Factura (ver 1.2.0) and Sequence Navigator (ver 
20 i.o.ibls) software packages (data not shown). 

Detection of the N141I mutation: The A-»T substitution at 
nucleotide 787 creates a Bell restriction site. The exon bearing 
this mutation was amplified from lOO ng of genomic DNA using 
lOpMol each of oligonucleotides corresponding to bp 733-751 of 
25 SEQ ID NO: 18 (end- labeled) and the complement of bp 846-829 of 
SEQ ID NO: 18 (unlabelled) , and PGR reaction conditions similar 
to those described below for the M239V mutation. 2ml of the PGR 
product was restricted with Bell (NEBL, Beverly, MA) in 10 ml 
reaction volume according to the manufacturers' protocol, and the 
30 products were resolved by non- denaturing polyacrylamide gel 

electrophoresis. In subjects with wild type sequences, the 114 
bp PGR product is cleaved into 68 bp and 46 bp fragments. Mutant 
sequences cause the product to be cleaved into 53 bp, 46 bp and 
15 bp. 

35 Detection of the M23 9V mutation : The A-.G sxibst i tut ion at 

^"^^®°5^^® ^080 deletes a Nlall I restriction site, allowing, t^^ 
presence of the M239V mutation to be detected by amplification 
from 100 ng of genomic DNA using lOpMol each of oligonucleotides 
corresponding to bp 1009-1026 of SEQ ID NO: 18 and the complement 

40 of bp 1,118-1101 of SEQ ID NO: 18, PGR conditions were: 0.5 U 
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Taq polymerase, 250 mM dNTPS, ImCi cy"P-dCTP, 1.5 itiM MgClj, 10 ml 
volume; 30 cycles of 94*C X 30 seconds, 58"C X 20 seconds, 72*C X 
20 seconds, to generate a 110 bp product. 2 ml of the PGR 
reaction were diluted to 10 ml and restricted with 3 U of Nlalll 
5 (NEBL, Beverly, MA) for 3 hours. The restriction products were 
resolved by non- denaturing polyacrylamide gel electrophoresis and 
visualized by autoradiography. Normal sxibjects show cleavage 
products of 55, 35, 15 and 6 bp, whereas the mutauit sequence 
gives fragments of 55, 50 and 6 bp. 

10 Detection of the I420T mutation: Similarly to the 

procedures above, the I420T mutation may be screened for by PGR 
amplification of genomic DNA using primers corresponding to bp 
1576-1593 of SEQ ID NO: 18 and the complement of bp 1721-1701 of 
SEQ ID NO: 18 to generate a 146 base pair product. This product 

15 may then be probed with allele specific oligonucleotides for the 
wild-type (e.g., bp 1616-1632 of SEQ ID NO: 18) ajid mutant (e.g., 
bp 1616-1632 of SEQ ID NO: 18 with a T-»C substitution at bp 1624) 
sequences . 

E;^giTnpt^ 12. Transqffljg n^igg, 

20 A series of wild type and mutant PSl and PS2 genes were 

constructed for use in the preparation of transgenic mice. 
Mutant versions of PSl and PS2 were generated by site-directed 
mutagenesis of the cloned cDNAs cc33 (PSl) and cc32 (PS2) using 
standard techniques. 

25 cDNAs cc33 and cc32 and their mutant versions were used to 

prepare two classes of mutant and wild type PSl and PS2 cDNAs, as 
described in Example 8. The first class, referred to as "full- 
length** cDNAs, were prepared by removing approximately 200 bp of 
the 3' untranslated region immediately before the poly A site by 

30 digestion with EcoRI (PSl) or PvuII (PS2) . The second class, 

referred to as "truncated" cDNAs, were prepared by replacing the 
5' untranslated region with a ribosome binding site (Kozak 
consensus sequence) placed immediately 5' of the ATG start codon. 
Various full length and truncated wild type and mutant PSl 

35 and PS2 cDNAs , prepared as described above, were introduced into 
one or more of the following vectors and the resulting constructs 
were used as a source of gene for the production of trauxsgenic 
mice. 

The cos.TET expression vector: This vector was derived from 
40 a cosmid clone containing the Syrian hamster PrP gene. It has 
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been described in detail by Scott et al. (1992) and Hsiao et al. 
(1995) . PSl and PS2 cDNAs (full length or truncated) were 
inserted into this vector at its Sail site. The final constructs 
contain 20 )cb of 5' sequence flanking the inserted cDNA. This 5' 
5 flanking sequence includes the PrP gene promoter, 50 bp of a PrP 
gene 5' untranslated region exon, a splice donor site, a i kb 
intron. and a splice acceptor site located innnediately adjacent 
to the Sail site into which the PSl or PS2 cDNA was inserted. 
The 3' sequence flanking the inserted cDNA includes an 
10 approximately 8 kb segment of PrP 3' xintranslated region 

including a polyadenylation signal. Digestion of this construct 
with NotI (PSl) or Fsel (PS2) released a fragment containing a 
mutant or wild type PS gene under the control of the PrP 
promoter. The released fragment was gel purified euid injected 
15 into the pronuclei of fertilized mouse eggs using the method of 
Hsiao et al. (1995) . 

Platelet -derived orowr-h f a ctor rgceptor fl-RiihiiTTj ^ 
£2IlS£Bi££a: PS cDNAs were also introduced between the Sail (full 
length PSl cDNAs) or Hindlll (truncated PSl cDNAs, full length 

20 PS2 CDNAs, and truncated PS2 cDNAs) at the 3' end of the human 
platelet derived growth factor receptor /3-subunit promoter and 
the EcoRI site at the 5' end of the SV40 poly A sequence and the 
entire cassette was cloned into the pZeoSV vector (Invitrogen, 
San Diego, CA.). Fragments released by Scal/BamHI digestion were 

25 gel purified and injected into the pronuclei of fertilized mouse 
eggs using the method of Hsiao et al. (1995). 

HVff"ftr^ g-ftgtiP ggnBtrtifftrff?: PSI and PS2 CDNAs were inserted 
into the Sail site of pBAcGH. The construct produced by this 
insertion includes 3.4 kb of the human 0 actin 5' flanking 

30 sequence (the human /J actin promoter, a spliced 78 bp human 

actin 5' untranslated exon and intron) and the PSl or PS2 insert 
followed by 2.2 kb of human growth hormone genomic sequence 
containing several introns and exons as well as a polyadenylation 
signal. Sfil was used to release a PS-containing fragment which 

35 was gel purified and injected into the pronuclei of fertilized 
mouse eggs using the method of Hsiao et al- (1995) . 

Phosphoqlvcerate kinase conatTi^.;i^^p . psi and PS2 cDNAs were 
introduced into the pkJ90 vector. The cDNAs were inserted 
between the Kpnl site downstream of the human phosphoglycerate 

40 kinase prompter and the Xbai site upstream of the 3' tintranslated 



CI lOCTTTI ITC CUee^ /niitr*«<i>t 



wo 96/34099 W W FCT/CA96/002d3 



113 - 



region of the human phOBphoglycerate kinase gene, PvuII/Hindlll 
(PSl cDNAs) or Pvull (PS2 cDNAs) digestion was used to release a 
PS- containing fragment which was then gel purified and injected 
into the pronuclei of fertilized mouse eggs as described above. 
5 Example 13 . Expression of recombinant PSl and PS2 in eukarvotic 

Recombinant PSl and PS2 have been expressed in a variety of 
cell types (e.g. PC12, neuroblastoma, Chinese hamster ovary, and 
human embryonic kidney 293 cells) using the pcDNA3 vector 

10 (Invitrogen, San Diego, CA.). The PSl and PS2 cDNAs inserted 
into this vector were the same full length and truncated cDNAs 
described in Example 8. 

These cDNAs were inserted between the CMV promoter and the 
bovine growth hormone polyadenylation site of pcDNA3 . The 

15 transgenes were expressed at high levels. 

In addition, PSl and PS2 have been expressed in COS cells 
using the pCMX vector. To facilitate tagging and tracing of the 
intracellular localization of the presenilin proteins, 
oligonucleotides encoding a sequence of 11 amino acids derived 

20 from the human c-myc antigen (see, e.g., Evan et al., 1985) and 
recognized by the monoclonal anti-myc antibody MYC 1-9E10.2 
(Product CRL 1729, ATCC, Rockville, Md.) were ligated in-frame 
either immediately in front of or immediately behind the open 
reading frame of PSl and PS2 cDNAs. Untagged pCMX constructs 

25 were also prepared. The c-royc-tagged constructs were also 
introduced into pcDNA3 for tramsf ection into CHO cells. 

Transient and stable transfection of these constructs has 
been achieved using Lipofectamine (Gibco/BRL) according to the 
manufacturer's protocols. Cultures were assayed for transient 

3 0 expression after 48 hours. Stably transf acted lines were 
selected using 0.5 mg/ml Geneticin (Gibco/BRL). 

Expression of transfected PS proteins was assayed by Western 
blot using the ant i -presenilin antibodies 1142, 519 and 520 
described above. Briefly, cultured transfected cells were 

35 solubilized (2% SDS, 5 mM EDTA, I mg/ml leupep tin and aprotinin) , 
and the protein concentration was determined by Lowry. Proteins 
were separated on SDS -PAGE gradient gels (4-20% Novex) and 
transferred to PVDF (10 mM CAPS) for 2 hr at a constant voltage 
(50V) . Non-specific binding was blocked with skim milk (5%) for 

40 1 hr. The proteins were then probed with the two rabbit 
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polyclonal antibodies (-Img/ml in TBS, pH 7.4) for 12 hrs, 
Presenilin cross -reactive species were identified using 
biotinylated goat-anti rabbit secondary antibody which was 
visualized using horseradish peroxidase -conjugated strepavadin 
5 tertiary, 4-chloro-napthol, and hydrogen peroxide. The c-myc- 

tagged presenilin peptides were assayed by Western blotting using 
both the anti -presenilin antibodies described above (to detect 
the presenilin peptide antigen) , and culture supernatant from the 
hybridoma MYC 1-9E10.2 diluted 1:10 for Western blots and 1:3 for 

10 inimunocytocheinistry (to detect the myc-epitope) . A major band of 
immunoreactivity of 50-60 kDa was identified by each of the 
various presenilin antibodies, and by the myc-epitope antibodies 
(for cell lines transfected with myc- containing plasmids) . Minor 
bands at -10-19 kDa and at -TOkDa were detected by some 

15 presenilin antibodies. 

For imraunocytochemistry, transfected cells were fixed with 
4% formaldehyde in Tris buffered saline (TBS), washed extensively 
with TBS plus 0.1% Triton and non-specific binding blocked with 
3% BSA. Fixed cells were probed with the presenilin antibodies 

20 (e.g., antibodies 520 and 1142, above; typically 5-10 mg/ml) , 
washed and visualized with FITC- or rhodamine- conjugated goat- 
anti rabbit secondary antibody. For c-myc-tagged presenilin 
constructs, the hybridoma MYC 1-9E10.2 supernatant diluted 1:3 
was used with anti -mouse secondary antibody. Slides were mounted 

25 in 90% glycerol with 0.1% phenylenediamine (ICN) to preserve 

fluorescence. Anti-BIP (or anti-calnexin) (StressGen, Victoria, 
B.C.) and wheat germ agglutinin (EY Labs, San Mateo, CA) were 
used as markers of endoplasmic reticulum and Golgi respectively. 
Doxible-immxino- labeling was also performed with anti -act in (Sigma, 

30 St. Louis, Mo,), anti -amyloid precursor protein (22C11, 

Boehringer Mannheim) and anti -neurofilament (NF-M specific, 
Sigma) in neuronal line NSC34. These immunofluorescence studies 
demonstrated that the transfection product is widely distributed 
within the cell, with a particularly intense perinuclear 

35 localization suggestive of the endoplasmic reticulum and the 
Golgi apparatus, which is similar to that observed in 
uiitransfected cells but is more intense, sometimes spilling over 
into the nuclear membrane. Co-immunolocalization of the c-myc 
and PS epitopes was observed in CHO and COS cells transiently 

40 transfected with the myc-tagged presenilin constructs. 
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Robust expression of the transfected presenilin gene in the 
transfected cells was thus proven by immxinocytochemistry. 
Northern blot. Western blots (using antibodies to presenilins as 
above, and using the monoclonal antibody MYC 1-9E10,2 to the myc 
5 tag in constructs with 3' or 5' c-myc tags). 

Exatnpj-e 14. Isolation of prese nilin binding proteins bv affinity 

chromatography. 

To identify the proteins which may be involved in the 
biochemical function of the presenilins, PSl-binding proteins 
10 were isolated using affinity chromatography. A GST- fusion 

protein containing the PSl TM6-*7 loop, prepared as described in 
Example 8, was used to probe human brain extracts, prepared by 
homogenizing brain tissue by Polytron in physiological salt 
solution. Non-specific binding was eliminated by pre-clearing 
15 the brain homogenates of endogenous GST- binding components by 
incubation with glutathione -Sepharose beads. These GST- free 
homogenates were then incubated with the GST- PS fusion proteins 
to produce the desired complexes with functional binding 
proteins. These complexes were then recovered using the affinity 
20 glutathione- Sepharose beads. After extensive washing with 

phosphate buffered saline, the isolated collection of proteins 
was separated by SDS-polyacrylamide gel electrophoresis (SDS- 
PAGE; Tris-tricine gradient gel 4-20%) . Two major bands were 
observed at -14 and 20 kD in addition to several weaker bands 
25 ranging from 50 to 60 kD. 

Pharmacologic modification of interaction between these 
proteins and the TM6-*7 loop may be employed in the treatment of 
Alzheimer's Disease. In addition, these proteins which are 
likely to act within the presenilin biochemical pathway may be 
30 novel sites of mutations that cause Alzheimer's Disease. 

B?S^mp^g IgQl^tion of presenilin bindi ng proteins bv two- 

hvbrid veast svstem. 

To identify proteins interacting with the presenilin 
proteins, a yeast expression plasmid vector (pAS2-l, Clontech) 
35 was generated by ligating an in-frame partial cDNA sequence 

encoding either residues 266-409 of the PSl protein or residues 
272-390 of the PS2 protein into the EcoRI and BamHI sites of the 
vector. The resultant fusion protein contains the GAL4 DNA 
binding domain coupled in- frame either to the TM6-»7 loop of the 
40 PSl protein or to the TM6-*7 loop of the PS2 protein. These 
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expression plasmids were co- transformed, along with purified 
plasmid DNA from the hiiman brain cDNA:pACT library, into yeast 
using the protocols of the Clontech Matchmaker yeast -two-hybrid 
kit (Clontech) . Yeast clones bearing human brain cDNAs which 
5 interact with the TM6->7 loop domain were selected by HIS 
resistance and ^gal+ activation. The clones were further 
selected by cyclohexamide sensitivity and the inserts of the 
human brain cDNAs were isolated by PGR and sequenced. Of 6 
million initial transformants, 200 positive clones were obtained 
10 after HIS selection, and 42 after ^gal+ color selection, carried 
out in accordance with the manufacturer's protocol for selection 
of positive colonies. Of these 42 clones there were several (5- 
8) independent clones representing the same genes. This 
indicates that these interactions are biologically real and 

15 reproducible. 

Example 16. Transg enic C. eleaans. 

Transgenic C. eleaans were obtained by microinjection of 
oocytes. The vectors pPD49,3 hsp 16-41 and pPD49.78 hsp 16-2 
were chosen for this purpose. Using the first of these vectors, 

20 transgenic gleg^^g were produced in which a normal hPSl gene 
or a mutant (L392V) was introduced. Transformed animals were 
detected by assaying expression of human cDNA on northern blots 
or western blots using human cDNA probe cc32 and antibodies 519, 
520 and 1142, described above. Vectors were also prepared and/or 

25 injected bearing a cis double mutant hPSl gene (M146L and L392V) , 
a normal hPS2 gene, and a mutant (N141I) hPS2 gene. 
P^^amplg 17. Cloning of a Prosophila presenilln hnmolocme. DmPfi . 

Redundant oligonucleotides 5' ctn ccn gar tgg acn gyc tgg 
(SEQ ID NO: 22) and 5' rca ngc {agt)at ngt ngt rtt cca (SEQ ID 

30 NO: 23) were designed from piiblished nucleotide sequence data for 
highly conserved regions of the presenilin/sel.l2 proteins 
ending/beginning with Trp (e.g., at residues Trp247 and Trp404 in 
PSl; Trp253 and Trp385 in PS2) . These primers were used for RT- 
PGR {50ml volume, 2mM MgClj, 30 cycles of 94'C x 30", 57'C x 20"; 

35 72*C X 20") from mRNA from adult and embryonic D. melanogaater . 
The products were then reamplified using cycle conditions of 94'C 
X 1', S9'c X 015' and 72*C x 1' and internal conserved redundant 
primer 5' ttt ttt etc gag acn gen car gar aga aay ga (SEQ ID NO: 
24) and 5' ttt ttt gga tec tar aa(agt) atr aar ten cc (SEQ ID NO: 

40 25) . The -600 bp product was cloned into the BamHI and Xhol 
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Sites of pBS. These products were sequenced and shown to contain 
an open reading frame with a putative amino acid sequence highly 
homologous to that of the human presenilins. This fragment was 
then used to screen a conventional D. melanoaaster cDNA/Zap 
5 library (Stratagene, CA) to recover six independent cDNA clones 
of size - 2-2.5 kb (clones pdsS, pdsl3, pdsl, pds3, pds7 and 
pdsl4) which were sequenced. The longest ORF encodes a 
polypeptide of 541 amino acids with 52% identity to the human 
presenilins . 

10 Ey^M?^? 19, — Assays for long isoforms of t: he Afl peptldgg. 

AS peptides were extracted with 99% formic acid for 60 
minutes (20 'C) from frozen cerebral cortex of histopathologically 
confirmed cases of FAD with PSl or fiAPP„, mutations; sporadic AD 
with no Jcnown family history of the disease; other adult onset 
15 neurodegenerative disorders (HD «= Huntington Disease; 

ALS = amyotrophic lateral sclerosis); Down's Syndrome (DS) ; and 
control subjects without neurologic symptoms. After 
centrifugation at 200,000 X g for 20 minutes, the supernatant was 
separated from the pellet, diluted, neutralized and examined by 
2 0 ELISA. To quantitate different species of AS, four monoclonal 
antibodies were used. Antibody BNT-77 (which detects epitopes 
from the center of Afi) and antibody BAN- 50 (which detects 
N-terminal residues) were used first to bind all types of AS 
including heterologous forms with or without N- terminal 
25 truncation (BNT-77) or only without N-terminal truncation 
(BAN-50) . Two additional monoclonal antibodies, which 
specifically detect either short-tailed AS ending at residue 40 
(antibody BA-27) or long-tailed AS ending at residues 42/43 
(antibody BC-05) , were then used to distinguish the different 
30 C- terminal forms of AS. Two site ELISA was carried out as 

described previously (Tamaoka et al., 1994; Suzuki et al., 1994). 
Briefly, 100 ng of standard peptides or the supematants from 
brain tissue were applied onto microplates coated with the BNT-77 
antibody, incubated at 4»C for 24 hours, washed with phosphate- 
35 buffered saline, and then incubated with HRP- labeled BA-27 and 
BC-05 antibodies at 4«C for 24 hours. HRP activities were 
assayed by color development using the TNB microwell peroxidase 
system as previously described. Cortical AS levels were compared 
between diagnostic groups using paired Student-t tests. Joint 
40 evaluation of all the AS isoform data, using the S tudent- Newman - 



CI iB^rm nr cuec-r /»• n c net 



wo 96/34099 



FCT/CA96/00263 



- 118 - 

Keuls multiple comparison of means test, revealed that Afil-42 
levels from fiAPP^j, and sporadic AD subjects were distinct from 
those for PSi mutation cases, but similar to controls. In 
contrast, three group were distinguishable when A£x-42 levels 
5 were considered: high (PSI and fiAPP,i, AD) , medium (sporadic AD) 
and low (control) . 

Specifically, measurement of the concentrations of the 
various A0 isoforms in the cerebral cortex of 14 control 
subjects, including five subjects with other neurodegenerative 

10 diseases with onset in the fourth and fifth decades of life, 

revealed only low concentrations of both short- tailed A^ (A^l-40: 
0.06 ± 0.02 nMol/gram wet tissue ± SEM; A^x-40: 0.17 ± 0.40) and 
long-tailed hP (Aj3l-42/43: 0.35 ± 0.17; A/Sx-42/43: 1.17 ± 
0.80) . In contrast, the long- tailed A^ peptides were 

15 significantly elevated in the cerebral cortex of all four 

subjects with PSI mutations (A^l-42/43: 6.54 ± 2.0, p = 0.05; 
A5X-42/43: 23.91 ± 4.00, p < 0.01). Similar increases in the 
concentration of long-tailed A/3 peptides were detected in the 
cortex of both subjects with ^APP^^, mutations (Ai8l-42/43: 2.03 ± 

20 1.04; A^x-42/43: 25.15 ± 5.74), and subjects with sporadic AD 

(APl-42/43: 1.21 ± 0.40, p « 0.008; Ai8x-42/43: 14.45 ± 2.81, p 
= 0.001). In subjects with PSI or /SAPP^^, mutations, this 
increase in long- tailed isoforms of A/S was accompanied by a small 
but non- significant increase in short-tailed A/S isoforms (e.g., 

25 A^x-40: 3.08 ± 1.31 in PSI mutants; 1.56 ± 0.07 in jSAPP^? 

mutants) , Thus, the ratio of long to short isoforms was also 
significantly increased. However, in the sporadic AD cases, the 
observed increase in long-tailed A/8 was accon5>anied typically by 
a much larger increase in short-tailed Ajff isoforms (AiSl-40: 3.92 

30 ± 1,42; A^x-40: 16,60 ± 5.88). This increase in short-tailed A0 
was statistically significant when compared to controls (p < 0.03 
for both Aj8l-40 and A^x-40), but was of borderline statistical 
significance when compared to the PSI and /SAPP^^, cases (p _ 
0.05) . Analysis of cortical samples from an adult subject with 

35 Down's syndrome revealed a pattern similar to that observed in 
sporadic AD. 

Although preferred embodiments of the invention have been 
described herein in detail, it will be understood by those 
skilled in the art that variations may be made thereto without 
40 departing from the spirit of the invention or the scope of the 
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appended claims. 

TABLE 1 





POSITION 


ELEMENT 




POSITION 


STATl (OAS) 


34-46 
278-286 


611-619 
631-639 


CAT box 






895-900 
975-982 




431-429 


1582-1590 


TATA box 






925-933 




443-451 


1965-1973 








978-988 




495-503 


2125-2133 


TFIID 






578-581 




533-541 










982-985 


STAT3 


36-43 
124-131 


737-744 
811-898 


TRXN (CAP) 
stmrt 






1002-1007 
1038-1043 




429-436 


1063-1070 


GC box 
(SPl) 






1453-1460 




496-503 


1686-1693 








1454-1452 




533-540 


1966-1973 


AP2, AP2-like 


numeroui 


occurrences 




537-544 


2104-2111 






throughout aequence 




632-639 


2407-2414 


NFIL6 


611 


-620 


1567-1576 


MED1,M£D1-Uke 


1121-1126 


1235-1240 




890 


-899 


1945-1954 




1126-1131 


1716-1721 




1062- 


1071 
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PSl Domain 


ADDroximate Position 


N- terminus 


1-81 


TMl 


82-100 


TMl-*2 


101-132 


TM2 


133-154 


TM2-»3 


155-163 


TM3 


164-183 


TM3-^ 


184-194 


TM4 


195-212 


TM4-»5 


213-220 


TM5 


221-238 


TM5-»6 


239-243 


TM6 


244-262 


TM6-.7 


263-407 


TM7 


408-428 


C- terminus 


429-467 
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15 



PS 2 Dmna 1 n 


ADProx^imate Position 


N»^**Tmi miio 


1-87 


TMl 


Q o "1 
oo * lUo 


TMl-*2 


XU / - i J4 


TM2 




TM2-*3 


lb X -±oy 




17U-189 




190-200 


TM4 


201-218 


TM4-*5 


^19-224 




225-244 


TM5-»6 


245-249 


TM6 


250-268 


TM6-*7 


269-387 


TM7 


388-409 


C- terminus 


410-448 
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Position i.n 


Nucleotide 


Amino Acid 


Functional 


Age of 




SEO ID NO • 1 




Change 


Domain 


Onset of 












FAD 


1 . 


NA 


NA 




«v uex iiimuB 


64 


2. 


492 


G-*C 


V82L 




55 


3. 


NA 


NA 


V 7 o c 


ini 


NA 


45 


591 


T-*C 






37 


5. 


664 


T-*C 


Mil Of 




49 


6. 


NA 


NA 


i JX W J V 




40 


7. 


676 






TM2 


35 


8. 


684 


A-#C 






45 


SO 


NA 


NA 


Ml 46V 




38 


10. 


736 


A-»G 




TM2-*3 


50 


11. 


NA 


NA 


W1 CIV 


TM2-*3 


47 


12. 


NA 


NA 


T^l 71 P 
ux / Xir 


TM3 


35 


13. 


NA 


NA 


G20QV 

U 7 V 


TM4 


NA 


3]£. 


NA 


NA 


I211T 

X X X X 


1 P14 


NA 


15. 


939 


G-»A 


A231T 




52 


16. 


985 


C-*A 




mo 


55 


17. 


1027 


C-*T 


A2fiOV 




40 


18. 


NA 


NA 


V* A V ^ XV 


IfQO^ / 


47 


ZSD. 


1039 


C-#T 


w % JU 




45 


20. 


NA 


NA 


* *» W / o 




35 


21. 


NA 


NA 


E280A 


TM6-»7 


47 


22. 


NA 


NA 


E2e0G 


TM6->7 


42 


23. 


1102 


C->T 




TMo-»7 


50 


22^ 


1104 


C->G 


L286V 


TM6-»7 


50 


25. 


NA 


deletion 


A291-319 


TM6-»7 


NA 


26. 


1399 


G^C 


G384A 


TM6-»7 


35 


27. 


1422 


C-*G 


L392V 


TM6-*7 


25-40 


28. 


1477 


G-*A 


C410Y 


TM7 


48 
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TABLE 5 



Position in Nucleotide Amino Acid Functional Age of 

SEQ ID MO: IB Change Change Domain Onset of fad 

1- 787 A-»T N141I TM2 50-65 

2. 1080 A-*G M239V TM5 50-70 

3- 1624 T-*C 1420T C-terminus 45 



28-61 
65-71 
109-112 
120-122 
218-221 
241-243 
267-269 



25-45 
50-63 
70-75 
114-120 
127-132 
162-167 
221-226 



TABLE 6 

302-310 
311-325 
332-342 
346-359 
372-382 
400-410 



TABLE 7 

282-290 
310-314 
321-338 
345-352 
380-390 
430-435 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: HSC RESEARCH AND DEVELOPMENT LIMITED 

PARTNERSHIP 

(B) STREET: 555 University Avenue 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE <ZIP) : MSG 1X8 

(G) TELEPHONE: (416) 813-5982 

(H) TELEFAX: (416) 813-5085 

(A) NAME: THE GOVERNING COUNCIL OF THE UNIVERSITY OF 

TORONTO 

(B) STREET: 106, Simcoe Hall, 27 King's College Circle 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : M5S lAl 

(G) TELEPHONE: (416) 978-7461 

(H) TELEFAX: (416) 978-1878 

(A) NAME: ST. GEORGE-HYSLOP, Peter H. 

(B) STREET: 210 Richview Avenue 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : MSP 3G3 

(A) NAME: FRASER, Paul E. 

(B) STREET: 611 Windermere Avenue 

(C) CITY: Toronto 

(D) STATE: Ontario 
(£) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : M6S 3L9 

(A) NAME: ROMMENS, Johanna M. 

(B) STREET: 105 McCaul Street 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : M5T 2XT 

(ii) TITLE OF INVENTION: GENETIC SEQUENCES AND PROTEINS 

RELATED TO ALZHEIMER'S DISEASE, 
AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sim & McBumey 

(B) STREET: 330 University Avenue, 6th Floor 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: MSG 1R7 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE^ Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1,30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/CA96/00263 

(B) FILING DATE: April. 29, 1996 

(C) CLASSIFICATION: 



SUBSrmJTF^UCCT rni 11 c -jet 



wo 96/34099 



FCr/CA96/00263 



- 128 - 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/509,359 

(B) FILING DATE: 31-JUL-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/496,841 

(B) FILING DATE: 28-JUN-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/431,048 

(B) FILING DATE: 28-APR-1995 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: RAE, Patricia A. 

(C) REFERENCE/DOCKET NUMBER: 7425-16 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 595-1155 

(B) TELEFAX: (416) 595-1163 



(2) INFORMATION FOR SEQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 249.. 1649 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1. .2675 

(D) OTHER INFORMATION: /note« "hPSl-1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TG<3GACAGGC AGCTCCGGGG TCCGCGGTTT CACATCGGAA ACAAAACAGC GGCTGGTCTG 60 

GAAGGAACCT GAGCTACGAG CCGCGGCGGC AGCGGGGCGG CGGGGAAGCG TATACCTTAAT 120 

CTGGGAGCCT GCAAGTGACA ACAGCCTTTG CGGTCCTTAG ACAGCTTGGC CTGGAGGAGA 180 

ACACATGAAA GAAAGAACCT CAAGAGGCTT TGTTTTCTGT GAAACAGTAT TTCTATACAG 240 

TTGCTCCA ATG ACA GAG TTA CCT GCA CCG TTG TCC TAC TTC CAG AAT GCA 290 
Met Thr Glu Leu Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala 
15 10 

CAG ATG TCT GAG GAC AAC CAC CTG AGC AAT ACT GTA CGT AGC CAG AAT 338 
Gin Met Ser Glu Asp Asn His Leu Ser Asn Thr Val Arg Ser Gin Asn 
15 20 25 30 

GAC AAT AGA GAA CGG CAG GAG CAC AAC GAC AGA CGG AGC CTT GGC CAC 386 
Asp Asn Arg Glu Arg Gin Glu His Asn Asp Arg Arg Ser Leu Gly His 
35 40 45 

CCT GAG CCA TTA TCT AAT GGA CGA CCC CAG GGT AAC TCC CGG CAG GTG 434 
Pro Glu Pro Leu Ser Asn Gly Arg Pro Gin Gly Asn Ser Arg Gin Val 
50 55 60 

GTG GAG CAA GAT GAG GAA GAA GAT GAG GAG CTG ACA TTG AAA TAT GGC 482 
Val Glu Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lya Tyr Gly 

^ 65 ■ ■ ■ ■ 70- - 75 

GCC AAG CAT GTG ATC ATG CTC TTT GTC CCT GTG ACT CTC TGC ATG GTG 530 
Ala Lys His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val 
80 85 90 
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GTG GTC GTG GCT ACC ATT AAG TCA GTC AGC TTT TAT ACC CGG AAG GAT 578 
Val Val Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp 
95 100 105 110 

GGG CAG CTA ATC TAT ACC CCA TTC ACA GAA GAT ACC GAG ACT GTG GGC 626 
Gly Gin Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly 
115 120 125 

CAG AGA GCC CTG CAC TCA ATT CTG AAT GCT GCC ATC ATG ATC AGT GTC 674 
Gin Arg Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val 
130 135 

ATT GTT GTC ATG ACT ATC CTC CTG GTG GTT CTG TAT AAA TAC AGG TGC 722 
He Val Val Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys 
145 150 155 

TAT AAG GTC ATC CAT GCC TGG CTT ATT ATA TCA TCT CTA TTG TTG CTG 770 
Tyr Lys Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu 
IfiO 165 170 

TTC TTT TTT TCA TTC ATT TAC TTG GGG GAA GTG TTT AAA ACC TAT AAC 818 
Phe Phe Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn 
175 180 185 190 

GTT GCT GTG GAC TAC ATT ACT GTT GCA CTC CTG ATC TGG AAT TTT GGT 866 
Val Ala Val Asp Tyr He Thr Val Ala Leu Leu He Trp Asn Phe Gly 
155 200 205 

GTG GTG GGA ATG ATT TCC ATT CAC TGG AAA GGT CCA CTT CGA CTC CAG 914 
Val Val Gly Met He Ser He His Trp Lys Gly Pro Leu Arg Leu Gin 
210 215 220 

CAG GCA TAT CTC ATT ATG ATT AGT GCC CTC ATG GCC CTG GTG TTT ATC 962 
Gin Ala Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He 
225 230 235 

AAG TAC CTC CCT GAA TGG ACT GCG TGG CTC ATC TTG GCT GTG ATT TCA 
Lys Tyr Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser 
240 245 250 

GTA TAT GAT TTA GTG GCT GTT TTG TOT CCG AAA GGT CCA CTT CGT ATG 1058 
Val Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met 
255 260 265 270 

CTG GTT GAA ACA GCT CAG GAG AGA AAT GAA ACG CTT TTT CCA GCT CTC 1106 
Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu 
275 280 285 

ATT TAC TCC TCA ACA ATG GTG TGG TTG GTG AAT ATG GCA GAA GGA GAC 1154 
He Tyr Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp 
290 295 300 

CCG GAA GCT CAA AGG AGA GTA TCC AAA AAT TCC AAG TAT AAT GCA GAA 1202 
Pro Glu Ala Gin Arg Arg Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu 
305 310 315 

AGC ACA GAA AGG GAG TCA CAA GAC ACT GTT GCA GAG AAT GAT GAT GGC 1250 
Ser Thr Glu Arg Glu Ser Gin Asp Thr Val Ala Glu Asn Asp Asp Gly 
320 325 330 

GGG TTC AGT GAG GAA TGG GAA GCC CAG AGG GAC AGT CAT CTA GGG CCT 
Gly Phe Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro 
335 340 345 350 

CAT CGC TCT ACA CCT GAG TCA CGA GCT GCT GTC CAG GAA CTT TCC AGC 
His Arg Ser Tbr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Ser 
355 360 365 

rTf f!5 S??: CCA gag GAA AGG GGA GTA AAA err GGA 1394 

Lys 
380 



1010 



1298 



1346 



^Zl T1 T WW UAA AGG GGA GTA AAA CTT GGA 

Ser He Leu Ala Gly Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly 
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TTG GGA GAT TTC ATT TTC TAC AGT GTT CTG GTT GGT AAA GCC TCA GCA 1442 
Leu Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala 
385 390 395 

ACA GCC AGT GGA GAC TGG AAC ACA ACC ATA GCC TGT TTC GTA GCC ATA 1490 
Thr Ala Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He 
400 405 410 

TTA ATT GGT TTG TGC CTT ACA TTA TTA CTC CTT GCC ATT TTC AAG AAA 1538 
Leu lie Gly Leu Cya Leu Thr Leu Leu Leu Leu Ala He Phe Lye Lvs 
415 420 425 430 

GCA TTG CCA GCT CTT CCA ATC TCC ATC ACC TTT GGG CTT GTT TTC TAC 1586 
Ala Leu Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr 
435 440 445 

TTT GCC ACA GAT TAT CTT GTA CAG CCT TTT ATG GAC CAA TTA GCA TTC 1634 
Phe Ala Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe 
450 455 460 

CAT CAA TTT TAT ATC TAGCATATTT GCGGTTAGAA TCCCATGGAT GTTTCTTCTT 1689 
His Gin Phe Tyr He 
465 

TGACTATAAC CAAATCTGGG GAGGACAAAG GTGATTTTCC TGTGTCCACA TCTAACAAAG 174 9 

TCAAGATTCC CGGCTGGACT TTTGCAGCTT CCTTCCAAGT CTTCCT6ACC ACCTTGCACT 1809 

ATTGGACTTT GGAAGGAGGT GCCTATAGAA AACGATTTTG AACATACTTC ATCGCAGTGG 1869 

ACTGTGTCCC TCGGTGCAGA AACTACCAGA TTTGAGGGAC GAGGTCAAGG AGATATGATA 1929 

GGCCCGGAAG TTGCTGTGCC CCATCAGCAG CTTGACGCGT GGTCACAGGA CGATTTCACT 1989 

GACACTGCGA ACTCTCAGGA CTACCGGTTA CCAAGAGGTT AGGTGAAGTG GTTTAAACCA 204 9 

AACGGAACTC TTCATCTTAA ACTACAOGTT GAAAATCAAC CCAATAATTC TGTATTAACT 2109 

GAATTCTGAA CTTTTCAGGA GGTACTGTGA GGAAGAGCAG GCACCAGCAG CAGAATGGGG 2169 

AATGGAGAGG TGGGCAGGGG TTCCAGCTTC CCTTTGATTT TTTGCTGCAG ACTCATCCTT 2229 

TTTAAATGAG ACTTGTTTTC CCCTCTCTTT GAGTCAAGTC AAATATGTAG ATTGCCTTTG 2289 

GCAATTCTTC TTCTCAAGCA CTGACACTCA TTACCGTCTG TGATTCCCAT TTCTTCCCAA 2349 

GGCCAGTCTG AACCTGAGGT TGCTTTATCC TAAAAGTTTT AACCTCAGGT TCCAAATTCA 2409 

GTAAATTTTG GAAACAGTAC AGCTATTTCT CATCAATTCT CTATCATGTT GAAGTCAAAT 2469 

TTGGATTTTC CACCAAATTC TGAATTTGTA GACATACTTG TACGCTCACT TGCCCCCAGA 2529 

TCCCTCCTCT GTCCTCATTC TTCTCTCCCA CACAAGCAGT CTTTTTCTAC AGCCAGTAAG 2589 

GCAGCTCTGT CRTGGTAGCA GATGGTCCCA TTATTCTAGG GTCTTACTCT TTGTATGATG 2649 

AAAAGAATGT GTTATGAATC GGTGCTGTCA GCCCTGCTGT CAGACCTTCT TCCACAGCAA 2709 

ATGAGATGTA TGCCCAAAGC GGTAGAATTA AAGAAGAGTA AAATGGCTGT TGAAGC 2765 



(2 ) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 467 amino acids 
(B) TYPE: amina acid 
(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 
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Met Thr Glu Leu Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala Gin Met 
15 10 IS 

Ser Glu Asp Asn His Leu Ser Asn Thr Val Arg Ser Gin Asn Asp Asn 
20 25 30 

Arg Glu Arg Gin Glu His Asn Asp Arg Arg Ser Leu Gly His Pro Glu 
35 40 45 

Pro Leu Ser Asn Gly Arg Pro Gin Gly Asn Ser Arg Gin Val Val Glu 
50 55 60 

Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala Lys 
^5 70 75 80 

His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val Val Val 
85 90 95 

Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly Gin 
100 105 110 

Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin Arq 
115 120 125 

Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val He Val 
130 135 140 

Val Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tyr Lys 
1^5 150 155 150 

Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu Phe Phe 
165 170 175 

Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val Ala 
IBO 185 190 

Val Asp Tyr He Thr Val Ala Leu Leu He Trp Asn Phe Gly Val Val 
1^5 200 205 

Gly Met He Ser He His Trp Lys Gly Pro Leu Arg Leu Gin Gin Ala 
210 215 220 

Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He Lys Tyr 



230 



235 



240 



Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser Val Tyr 
245 250 255 

Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu Val 
2€0 265 270 

Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu He Tyr 
275 280 285 

Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro Glu 
250 295 300 

Ala Gin Arg Arg Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu Ser Thr 

310 315 320 

Glu Arg Glu Ser Gin Asp Thr Val Ala Glu Asn Asp Asp Gly Gly Phe 
325 330 335 

Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro His Arg 
340 ' ^ 



345 



350 



Ser Thr Pro Glu Ser Arg Ala Ala Val Olh Glu Leu Ser Ser Ser He 
355 360 365 

^? Gly Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu Gly 
370 375 
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Aap Phc lie Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr Ala 
385 390 395 

Ser Gly Asp Trp Aan Thr Thr He Ala Cys Phe Val Ala He Leu He 
405 410 

Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala Leu 
420 425 430 

Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr Phe Ala 
435 440 445 

Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe His Gin 
450 455 460 

Phe Tyr He 
465 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 557.. 1945 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..30B6 

(D) OTHER INFORMATION: /note- "hPSl-2" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GAATTCGGCA CGAGGGAAAT GCTGTTTGCT CGAAGACGTC TCAGGGCGCA GGT6CCTTGG 60 

GCCGGGATTA GTAGCCGTCT GAACTGGAGT GGAGTAGGAG AAAGAGGAAG CGTCTTGGGC 120 

TGGGTCTGCT TGAOCAACTG GTGAAACTCC GCGCCTCACG CCCCGGGTGT GTCCTTGTCC 180 

AGGGGCGACG AGCATTCTGG GCGAAGTCCG CACSCCTCTT GTTCGAGGCG GAAGACGGGG 240 

TCTGATSCTT TCTCCTTGGT CGGGMCTGTC TCGAGGCATG CATGTCCAGT GACTCTTGTG 300 

TTTGCTGCTG CTTCCCTCTC AGATTCTTCT CACCGTTGTG GTCAGCTCTG CTTTAGGCAT 360 

ATTAATCCAT AGTGGAGGCT GGGATGGGTG AGAGAATTGA GGTGACTTTT CCATAATTCA 420 

GACCTAATCT GGGAGCCTGC AAGTGACAAC AGCCTTTGCG GTCCTTAOAC AGCTTGGCCT 480 

GGAGGAGAAC ACATGAAAGA AAGAACCTCA AGAGGCTTTG TTTTCTGTGA AACAGTATTT 540 

CTATACAGTT GCTCCA ATG ACA GAG TTA CCT GCA CCG TTG TCC TAC TTC 589 
Met Thr Glu Leu Pro Ala Pro Leu Ser Tyx Phe 
1 5 io 

CAG AAT GCA CAG ATG TCT GAG GAC AAC CAC CTG AGC AAT ACT AAT GAC 637 
Gin Asn Ala Gin Met Ser Glu Aap Aan His Leu Ser Asn Thr Asn Asp 
15 26 25 

AAT AGA GAA CGG CAG GAG CAC AAC GAC AGA CGG AGC CTT GGC CAC CCT 685 
Asn Arg Glu Arg <31n Glu His Asn Asp Arg Arg Ser Leu Gly Hie Pro 
30 35 40 

GAG CCA TTA TCT AAT GGA CGA CCC CAG GGT AAC TCC CGG CAG GTG GTG 733 
Glu Pro Leu Ser Asn Gly Arg Pro Gin Gly Asn Ser Arg Gin Val Val 
45 SO 55 



<;iiH5TTnrrp ^uccrrotii c net 



973 



WO 96/34099 ^ ^ PCT/CA96/00263 

" 133 - 

GAG CAA GAT GAG GAA GAA GAT GAG GAG CTG ACA TTG AAA TAT GGC GCC 781 
Qlu Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lye Tyr Gly Ala 
€0 65 70 75 

AAG CAT GTG ATC ATG CTC TTT GTC CCT GTG ACT CTC TGC ATG GTG GTG 829 
Lys His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val Val 
80 85 90 

GTC GTG GCT ACC ATT AAG TCA GTC AGC TTT TAT ACC CGG AAG GAT GGG 877 
Val Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly 
95 100 105 

CAG CTA ATC TAT ACC CCA TTC ACA GAA GAT ACC GAG ACT GTG GGC CAG 925 
Gin Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin 
110 115 120 

AGA GCC CTG CAC TCA ATT CTG AAT GCT GCC ATC ATG ATC AGT GTC ATT 
Arg Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val He 
125 130 135 

GTT GTC ATG ACT ATC CTC CTG GTG GTT CTG TAT AAA TAC AGG TGC TAT 1021 

Val Val Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tvr 
140 145 ISO ^ ^ r 

AAG GTC ATC CAT GCC TGG CTT ATT ATA TCA TCT CTA TTG TTG CTG TTC 1069 

Lys Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu Phe 
160 165 170 

TTT TTT TCA TTC ATT TAC TTG GGG GAA GTG TTT AAA ACC TAT AAC GTT 1117 
Phe Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val 
175 180 185 

GCT GTG GAC TAC ATT ACT GTT GCA CTC CTG ATC TGG AAT TTG GGT GTG 1165 
Ala Val Asp Tyr He Thr Val Ala Leu Leu He Trp Asn Leu Gly Val 
190 195 200 

GTG GGA ATG ATT TCC ATT CAC TGG AAA GGT CCA CTT CGA CTC CAG CAG 1213 
Val Gly Met He Ser He His Trp Lys Gly Pro Leu Arg Leu Gin Gin 
205 210 215 

GCA TAT CTC ATT ATG ATT AGT GCC CTC ATG GCC CTG GTG TTT ATC AAG 1261 
Ala Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He Lys 
220 225 230 235 

TAC CTC CCT GAA TGG ACT GCG TGG CTC ATC TTG GCT GTG ATT TCA GTA 1309 
Tyr Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser Val 
240 245 250 

TAT GAT TTA GTG GCT GTT TTG TGT CCQ AAA GGT CCA CTT CGT ATG CTG 1357 
Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu 
255 260 265 

GAA ACA GCT CAG GAG AGA AAT GAA ACG CTT TTT CCA GCT CTC ATT 1405 
Val Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu He 
270 275 280 

TAC TCC TCA ACA ATG GTG TGG TTG GTG AAT ATG GCA GAA GGA GAC CCG 
Tyr Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro 
285 290 295 

GAA GCT CAA AGG AGA GTA TCC AAA AAT TCC AAG TAT AAT GCA GAA AGC 1501 
Glu Ala Gin Arg Arg . Val Ser Lya Asn Ser Lys Tyr Asn Ala Glu Ser 
300 305 310 315 

ACA GAA AGG GAG TCA CAA GAC ACT GTT GCA GAG AAT GAT GAT GGC GGG 
Thr Glu Arg Glu Ser .Gin Asp Thr Val Ala Glu Asn Asp Asp Gly Gly 
320 325 330 

TTC AGT GAG GAA TGG GAA GCC CAG AGG GAC AGT CAT CTA GGG CCT CAT 
Phe Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro His 
335 340 345 



1453 



1549 



1597 
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CGC TCT ACA CCT GAG TCA CGA OCT GCT GTC CAG GAA CTT TCC AGC AGT 
Arg Ser Thr Pro Glu Ser Ar^ Ala Ala Val Gin Glu Leu Ser Ser Ser 
350 355 360 

ATC CTC GCT GGT GAA GAC CCA GAG GAA AGG GGA GTA AAA CTT GGA TTG 
He Leu Ala Gly Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu 
365 370 375 

GGA GAT TTC ATT TTC TAC AGT GTT CTG GTT GGT AAA GCC TCA GCA ACA 
Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr 
380 385 390 395 

GCC AGT GGA GAC TGG AAC ACA ACC ATA GCC TGT TTC GTA GCC ATA TTA 
Ala Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He Leu 
400 405 410 

ATT GGT TTG TGC CTT ACA TTA TTA CTC CTT GCC ATT TTC AAG AAA GCA 
He Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala 
415 420 425 

TTG CCA GCT CTT CCA ATC TCC ATC ACC TTT GGG CTT GTT TTC TAC TTT 
Leu Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr Phe 
430 435 440 

GCC ACA GAT TAT CTT GTA CAG CCT TTT ATG GAC CAA TTA GCA TTC CAT 
Ala Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe His 
445 450 455 

CAA TTT TAT ATC TAGCATATTT GCGGTTAGAA TCCCATGGAT GTTTCTTCTT 

Gin Phe Tyr He 

460 

TQACTATAAC CAAATCTGGG GAGGACAAAG GTGATTTTCC TGTGTCCACA TCTAACAAAG 
TCAAGATTCC CGGCTGGACT TTTGCAGCTT CCTTCCAAGT CTTCCTGACC ACCTTGCACT 
ATTGGACTTT GGAAGGAGGT GCCTATAGAA AACGATTTTG AACATACTTC ATCGCAGTGG 
ACTGTGTCCT CGGTGCAGAA ACTACCAGAT TTGAGGGACG AGGTCAAGGA GATATGATAG 
GCCCGGAAGT TGCTGTGCCC CATCAGCAGC TTGACGCGTG GTCACAGGAC GATTTCACTG 
ACACTGCGAA CTCTCAGGAC TACCGGTTAC CAAGAGGTTA GGTGAAGTGG TTTAAACCAA 
ACGGAACTCT TCATCTTAAA CTACACGTTG AAAATCAACC CAATAATTCT GTATTAACTG 
AATTCTGAAC TTTTCAGGAG GTACTGTGAG GAAGAGCAGG CACCAGCAGC AGAATGGGGA 
ATGGAGAGGT GGGCAGGGGT TCCAGCTTCC CTTTGATTTT TTGCTGCAGA CTCATCCTTT 
TTAAATGAGA CTTGTTTTCC CCTCTCTTTG AGTCAAGTCA AATATGTAGA TGCCTTTGGC 
AATTCTTCTT CTCAAGCACT GACACTCATT ACCGTCTGTG ATTGCCATTT CTTCCCAAGG 
CCAGTCTGAA CCTGAGGTTG CTTTATCCTA AAAGTTTTAA CCTCAGGTTC CAAATTCAGT 
AAATTTTGGA AACAGTACAG CTATTTCTCA TCAATTCTCT ATCATGTTGA AGTCAAATTT 
GGATTTTCCA CCAAATTCTG AATTTGTAGA CATACTTGTA CGCTCACTTG CCCCAGATGC 
CTCCTCTGTC CTCATTCTTC TCTCCCACAC AAGCAGTCTT TTTCTACAGC CAGTAAGGCA 
GCTCTGTCGT GGTAGCAGAT GGTCCCACTT ATTCTAGGGT CTTACTCTTT GTATGATGAA 
AAGAATGTGT TATGAATCGG TGCTGTCAGC CCTGCTGTCA GACCTTCTTC CACAGCAAAT 
GAGATGTATG CCCAAAGCGG TAGAATTAAA GAAGAGTAAA ATGGCTGTTG AAGCAAAAAA 
AAAAAAAAAA AAAAAAAAAA A 



1645 

1693 

1741 

1789 

1837 

1885 

1933 

1985 

2045 
2105 
2165 
2225 
2285 
2345 
2405 
2465 
2525 
2585 
2645 
2705 
2765 
2825 
2885 
2945 
3005 
3065 
3086 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Thr Glu Leu Pro Ala Pro Leu Ser Tyr Phe Gin Aan Ala Gin Met 
15 10 IS 

Ser Glu Asp Aan His Leu Ser Asn Thr Asn Asp Asn Arg Glu Arg Gin 
20 25 30 

Glu His Asn Asp Arg Arg Ser Leu Gly His Pro Glu Pro Leu Ser Asn 
35 40 45 

Gly Arg Pro Gin Gly Asn Ser Arg Gin Val Val Glu Gin Asp Glu Glu 
50 55 60 

Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala Lys His Val lie Met 
65 70 75 80 

Leu Phe Val Pro Val Thr Leu Cys Met Val Val Val Val Ala Thr lie 
85 90 95 

Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly Gin Leu lie Tyr Thr 
100 105 110 

Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin Arg Ala Leu His Ser 
115 120 125 

He Leu Asn Ala Ala He Met He Ser Val He Val Val Met Thr He 
130 135 140 

Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tyr Lys Val He His Ala 
145 150 155 160 

Trp Leu He He Ser Ser Leu Leu Leu Leu Phe Phe Phe Ser Phe He 
165 170 175 

Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val Ala Val Asp Tyr He 
180 185 190 

Thr Val Ala Leu Leu He Trp Asn Leu Gly Val Val Gly Met He Ser 
155 200 205 

He His Trp Lys Gly Pro Leu Arg Leu Gin Gin Ala Tyr Leu He Met 
210 215 220 

He Ser Ala Leu Met Ala Leu Val Phe He Lys Tyr Leu Pro Glu Tro 
225 230 235 240 

Thr Ala Trp Leu He Leu Ala Val He Ser Val Tyr Asp Leu Val Ala 
245 250 255 

Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu Val Glu Thr Ala Gin 
260 265 270 

Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu He Tyr Ser Ser Thr Met 
275 280 285 

Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro Glu Ala Gin Arq Arq 
290 295 300 ^ 

Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu Ser Thr Glu Arg Glu Ser 
305 310 315 320 

Gin Asp Thr Val Ala Glu Asn Asp Asp Gly Gly Phe Ser Glu Glu Trp 
325 330 335 
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Glu Ala Gin Arg Asp Ser His Leu Gly Pro His Arg Ser Thr Pro Glu 
340 345 350 

Ser Arg Ala Ala Val Gin Glu Leu Ser Ser Ser He Leu Ala Gly Glu 
355 360 365 

Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu Gly Asp Phe He Phe 
370 375 380 

Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr Ala Ser Gly Asp Trp 
385 390 395 400 

Asn Thr Thr He Ala Cys Phe Val Ala He Leu He Gly Leu Cys Leu 
405 410 415 

Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala Leu Pro Ala Leu Pro 
420 425 430 

He Ser He Thr Phe Gly Leu Val Phe Tyr Phe Ala Thr Asp Tyr Leu 
435 440 445 

Val Gin Pro Phe Met Asp Gin Leu Ala Phe His Gin Phe Tyr He 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..2494 

(D) OTHER INFORMATION: /note- "lExln2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



AAGCTTTTGT 


GTGTAAAAAG 


TATTAGAATC 


TCATGTTTTT GAACAAGGTT GGCAGTGGGT 


60 


TGGGAGGAGG 


GATTGGAGAT 


TGATGCGATA 


GGAATGTGAA GGGATAGCTT GGGGTGGATT 


120 


TTATTTTTTA 


ATTTTAATTT 


ttatttkttg 


AGATGGAGTC TTGCTCTGTC TCCCAGGCTG 


180 


GAGTGCAGTG 


GTGTGATCTC 


AGCTCACGGG 


TTCAAGCGAT TCTCCTGCTG CAGCCTCCCG 


240 


AGTAGCTGGG 


ATTACAGGAG 


CGCGCCACCA 


CACCCGGNTA ATTTNNTTGT ATTTTTAGTA 


300 


GAGACGGGGT 


TTCACCATGT 


TGGGTTAGGC 


TGGTCTAGAA CTCCCAACCT CATGATCCGC 


360 


CTGCTTCGGC 


CTCCCAAAGT 


GCCGGAATTA 


CAGGCGTGAG CGACTGCACC CGGCCGCTTG 


420 


GGGGTGGATT 


TTTAAAGAAA 


CTTTAGAAGA 


ATGTAACTTG SCCAGATACC ATGTACCGTT 


480 


AATTTCATTT 


TCGGTTTTTK 


GAATACCCAT 


GTTTGACATT TMTCCGTTCA CCTTGATTAA 


540 


ATAAGGTAGT 


ATTCATTTTT 


TAGTTTTAGC 


TTTTGGATAT ATGTGTAAGT GTGGTATGCT 


600 


GTCtAATGAA 


TTAAGACAAT 


TGiSTNCTKTC 


TTTACCCMAM ANCTGGACMA AGAOCAGGCA 


660 


AGATGCAAAA 


ATCAAGTGAC 


CCAGCAAACC 


AGACACATTT TCTGCTCTCA GCTAGCTTGC 


720 


CACCTAGAAA 


GACTGGTTGT CAAAGTTGGA 


GTCCAAOAAT CGCGGAOOAT GTTTAAAATG 


780 


CAGTTTCTCA 


GGTTCTCNCC ACCCACCAGA 


AGTTTTGATT CATTGAGTGG TGGGAGAGGG 


840 


CAGAGATATT 


TGCGATTTTA ACAGCATTCT 


CTTOATTGTG ATGCAGCTGG TTCSCAAATA 


900 


GGTACCCTAA 


AGAAATGACA GGTGTTAAAT 


TTAGGATGGC CATOGCTTGT ATGCCGGGAG 


960 
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AAGCACACGC TGGGCCCAAT TTATATAGGG GCTTTCGTCC TCAGCTCXSAG CARCCTCAGA 1020 

ACCCCGACAA CCYACGCCAG CKCTCTGGGC GGATTCCRTC AGKTGGGGAA GSCCAGGTGG 1080 

AGCTCTGGKT TCTCCCCGCA ATCX3TTTCTC CAGGCCGGAG GCCCCGCCCC CTTCCTCCTG 1140 

GCTCCTCCCC TCCTCCGTGG GCCGNCCGCC AACGACGCCA GAGCCGGAAA TGACX5ACAAC 1200 

GGTGAGGGTT CTCGGGCGGG GCCTGGGACA GGCAGCTCCG GGGTCCX5CGG TTTTCACATC 1260 

GGAAACAAAA CAGCGGCTGG TCTGGAAGGA ACCTGAGCTA CGACCCGCGG CGGCAGCGGG 1320 

GCGGCGGGGA AGCGTATGTG CGTGATGGGG AGTCCGGGCA AGCCAGGAAG GCACCGCGGA 1380 

CATGGGCGGC CGCGGGCAGG GNCCGGNCCT TTGTGGCCGC CCGGGCCGCG AAGCCGGTGT 1440 

CCTAAAAGAT GAGGGGCGGG GCGCGGCCGG TTGGGGCTGG GGAACCCCGT GTGGGAAACC 1500 

AGGAGGGGCG GCCCGTTTCT CGGGCTTCGG GCGCGGCCGG GTGGAGAGAG ATTCCGGGGA 1560 

GCCTTGGTCC GGAAATGCTG TTTGCTCGAA GACGTCTCAG GGCGCAGGTG CCTTGGGCCG 1620 

GGATTAGTAG CCGTCTGAAC TGGAGTGGAG TAGGAGAAAG AGGAAGCGTC TTGGGCTGGG 1680 

TCTGCTTGAG CAACTGGTGA AACTCCGCGC CTCACGCCCC GGGTGTGTCC TTGTCCAGGG X740 

GCGACGAGCA TTCTGGGCGA AGTCCGCACG CCTCTTGTTC GAGGCGGAAG ACGGGGTCTT 1800 

GATGCTTTCT CCTTGGTCGG GACTGTCTCG AGGCATGCAT GTCCAGTGAC TCTTGTGTTT 1860 

GCTGCTGCTT CCCTCTCAGA TTCTTCTCAC CGTTGTGGTC AGCTCTGCTT TAGGCATATT 1920 

AATCCATAGT GGAGGCTGGG ATGGGTGAGA GAATTGAGGT GACTTTTCCA TAATTCAGGT 1980 

GAGATGTGAT TAGAGTYCGG ATCCTNCGGT GGTGGCAGAG GCTTACCAAG AAACACTAAC 2040 

GGGACATGGG AACCAATTGA GGATCCAGGG AATAAAGTGT GAAGTTQACT AGGAGGTTTT 2100 

CAGTTTAAGA ACATGGCAGA GACATTCTCA GAAATAAGGA AGTTAGGAAG AAAGACCTGG 2160 

TTTAGAGAGG AGGGCGAGGA AGTGGTTTGG AAGTGTCACT TTGGAAGTX5C CAGCAG6TGA 2220 

AAATGCCCTG TGAACAGGAC TGGAGCTGAA AACAGGAATC AATTCCATAG ATTTCCAGTT 2280 

GATGTTGGAG CAGTGGAGAA GTCTAANCTA AGGAAGGGGA AGAGGAGGCC AAGCCAAACA 2340 

CTTAGGAACA CTTNCNACGA GGGGGTGGAA GAA6AGCAAG GAGCCAGCTG AGGAGAATGA 2400 

GTGTGGTTGG AGAACCACCA CAGCNCAGGG TCGCCAGANC TGAGGAAGGG GAGGGAAGCT 2460 

TATCGAGKAM SGWCRACMKC GAGTTGGCAG GGAT 2494 

(2) INFORMATION FOR SEO ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1117 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 11X7 

(D) OTHER INFORMATION: /note« "lEx3n4." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGATGCGCCC GCCTTGGCCT CCCAAAGTGC TGGGATTACA GGCATGAGCC ACCGCTCCTG 60 

GCTGAOTCTG CGATTTCTTG CCAGCTCTAC CCAGTTOTGT CATCTTAAGC AAGTCACTGA 120 

ACTTCTCTGG ATTCCCTTCT CCTNNWGTAA AATAAGNATG TTATCTGNCC NNCCTCCCTT ;80 
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240 
300 



460 
540 



720 
780 
840 
900 
960 



GGGCATTGTG ATAAGGATAA GATGACATTA TAGAATNTNG CAAAATTAAA AGCGCTAGAC 
AAATGATTTT ATGAAAATAT AAAGATTAGN TTGAGTTTGG GCCAGCATAG AAAAAGGAAT 
GTTGAGAACA TTCCNTTAAG GATTACTCAA GCYCCCCTTT TGSTGKNWAA TCAGANNGTC 3 60 

ATNNAMNTAT CNTNTGTGGG YTGAAAATGT TTGGTTGTCT CAGGCGGTTC CTACTTATTG 420 
CTAAAGAGTC CTACCTTGAG CTTATAGTAA ATTTGTCAGT TAGTTCAAAG TCGTGACAAA 
TTAATACATT CCTGGTTTAC AAATTGGTCT TATAAGTATT TGATTGGTNT AAATGNATTT 
ACTAGGATTT AACTAACAAT GGATGACCTG GTGAAATCCT ATTTCAGACC TAATCTGGGA 600 
GCCTGCAAGT GACAACAGCC TTTGCGGTCC TTAGACAGCT TGGCCTGGAG GAGAACACAT 660 
GAAAGAAAGG TTTGWNTCTG NTTAWTGTAA TCTATGRAAG TGTTTTTWAT MACAGTATAA 
TTGTMTGMAC AAAGTTCTGT TTTTCTTTCC CTTTNCAGAA CCTCAAGAGG CTTTGTTTTC 
TGTGAAACAG TATTTCTATA CAGTTGCTCC AATGACAGAG TTACCTGCAC CGTTGTCCTA 
CTTCCAGAAT GCACAGATGT CTGAGGACAA CCACCTGAGC AATACTOTAC GTAGCCAGGT 
ACAGCGTCAG TYTCTNAAAC TGCCTYYGNC AGACTGGATT CACTTATCAT CTCCCCTCAC 
CTCTGAGAAA TGCTGAGGGG GSTAGGNAGG GCTTTCTCTA CTTNACCACA TTTNATAATT 1020 
ATTTTTGGGT GACCTTCAGC TGATCGCTGG GAGGGACACA GGGCTTNTTT AACACATAGG 1080 
GTGTTGGATA CAGNCCCTCC CTAATTCACA TTTCANC ^117 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1727 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..1727 

(D) OTHER INFORMATION: /note« "lEx5« 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGATCCCTCC CCTTTTTAGA CCATACAAGG TAACTTCCGG ACGTTGCCAT GGCATCTGTA 60 

AACTGTCATG GTGTTGGCGG GGAGTGTCTT TTAGCATGCT AATGTATTAT AATTAGCGTA 120 

TAGTGAGCAG TGAGGATAAC CAGAGGTCAC TCTCCTCACC ATCTTGGTTT TGGTGGGTTT 180 

TGGCCAGCTT CTTTATTGCA ACCAGTTTTA TCAGCAAGAT CTTTATGAGC TGTATCTTGT 240 

GCTGACTTCC TATCTCATCC CGNAACTAAG AGTACCTAAC CTCCTGCAAA TTGMAGNCCA 300 

GNAGGTCTTG GNCTTATTTN ACCCAGCCCC TATTCAARAT AGAGTNGYTC TTGGNCCAAA 360 

CGCCYCTGAC ACAAGGATTT TAAAGTCTTA TTAATTAAGG TAAGATAGKT CCTTGSATAT 420 

GTGGTCTGAA ATOICAGAAA GCTGAATTTG GAAAAAGGTG CTTGGASCTG CAGCCAGTAA 480 

ACAAGTTTTC ATGCAGGTGT CAGTATTTAA GGTACATCTC AAAGGATAAG TACAATTGTG 540 

TATGTTGGGA TGAACAGAGA GAATGGAGCA ANCCAAGACC CAGGTAAAAG AGAGGACCTG 600 

AATGCCTTCA GTGAACAATG ATAGATAATC TAGACTTTTA AACTOCATAC TTCCTGTACA 660 

TTGTTTTTTC TTGCTTCAGG TTTTTAGAAC TCATAGTGAC GGGTCTGTTG TTAATCCCAG 720 

GTCTAACCGT TACCTTOATT CTGCTGAOAA TCTGATTTAC TQAAAATGTT TTTCTTGTGC 780 
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TTATAGAATG 


ACAATAGAGA 


ACGGCAGGAG 


CACAACGACA 


GACGGAGCCT 


TGGCCACCCT 


840 


GAGCCATTAT 


CTAATGGACG 


ACCCCAG6GT 


AACTCCCGGC 


AGGTGGTGGA 


GCAAGATGAG 


900 


GAAGAAGATG 


AGGAGCTGAC 


ATTGAAATAT 


GGCGCCAAGC 


ATGTGATCAT 


GCTCTTTGTC 


960 


CCTGTGACTC 


TCTGCATGGT 


GGTGGTCGTG 


GCTACCATTA 


AGTCAGTCAG 


CTTTTATACC 


1020 


CGGAA6GATG 


GGCAGCTGTA 


CGTATGAGTT 


TKGTTTTATT 


ATTCTCAAAS 


CCAGTGTGGC 


1080 


TTTTCTTTAC 


AGCATGTCAT 


CATCACCTTG 


AAGGCCTCTN 


CATTGAAGGG 


GCATGACTTA 


1140 


GCTGGAGAGC 


CCATCCTCTG 


TGATGGTCAG 


GAGCAGTTGA 


GAGANCGAGG 


GGTTATTACT 


1200 


TCATGTTTTA 


AGTGGAGAAA 


AGGAACACTG 


CAGAAGTATG 


TTTCCTGTAT 


GGTATTACTG 


1260 


GATAGGGCTG 


AAGTTATGCT 


GAATTGAACA 


CATAAATTCT 


TTTCCACCTC 


AGGGNCATTG 


1320 


GGCGCCCATT 


GNTCTTCTGC 


CTAGAATATT 


CTTTCCTTTN 


CTOACTTKGG 


NGGATTAAAT 


1380 


TCCTGTCATC 


CCCCTCCTCT 


TGGTGTTATA 


TATAAAGTNT 


TGGTGCCGCA 


AAAGAAGTAG 


1440 


CACTCGAATA 


TAAAATTTTC 


CTTTTAATTC 


TCAGCAAGGN 


AAGTTACTTC 


TATATAGAAG 


1500 


GGTGCACCCN 


TACAGATGGA 


ACAATGGCAA 


GCGCACATTT 


GGGACAAGGG 


AGGGGAAAGG 


1560 


GTTCTTATCC 


CTGACACACG 


TGGTCCCNGC 


TGMTGTGTKC 


TNCCCCCACT 


GANTAGGGTT 


1620 


AGACTGGACA 


GGCTTAAACT 


AATTCCAATT 


GGNTAATTTA 


AAGAGAATNA 


TGGGGTGAAT 


1680 


GCTTTGGGAG 


GAGTCAAGGA 


AGAGNAGGTA 


GNAGGTAACT 


TGAATGA 




1727 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 83 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..1883 

(D) OTHER INFORMATION: /note- «lEx6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0i8: 

CNCGTATAAA AGACCAACAT TGCCANCNAC AACCACAGGC AAGATCTTCT CCTACCTTCC 60 

CCCNNGGTGT AATACCAAGT ATTCNCCAAT TTGTGATAAA CTTTCATTGG AAAGTGACCA 120 

CCCTCCTTGG TTAATACATT GTCTGTGCCT GCTTTCACAC TACAGTAGCA CAGTTGAGTG 180 

TTTGCCCTGG AGACCATATG ACCCATAGAG CTTAAAATAT TCAGTCTGGC TTTTTACAGA 240 

GATGTTTCTG ACTTTGTTAA TAGAAAATCA ACCCAACTGG TTTAAATAAT GCACATACTT 300 

TCTCTCTCAT AGAGTAGTGC AGAGGTAGNC AGTCCAGATT AGTASGGTGG CTTCACGTTC 360 

ATCCAAGGAC TCAATCTCCT TCTTTCTTCT TTAGCTTCTA ACCTCTAGCT TACTTCAGGG 420 

TCCAGGCTGG AGCCCTASCC TTCATTTCTG ACAGTAGGAA GGAGTAGGGG AGAAAAGAAC 480 

ATAGGACATG TCAGCAGAAT TCTCTCCTTA GAAGTTCCAT ACACAACACA - TCTCCCTAGA 540 

AGTCATTGCC CTTACTT6TT CTCATAGCCA TCCtAAATAT AAGGGAGTCA GAAOTAAAOT 600 

CTKKNTGGCT GGGAATATTG GCACCTGGAA TAAAAATGTT TTTCTGTGAA TGAjSAAACAA 660 

GGGGAAGATG GATATGTGAC ATTATCTTAA GACAACTCCA GTTGCAATTA CTCTGCAGAT 720 

GAGAOGCACT AATTATAAGC CATATTACCT TTCTTCTGAC AACCACTTOT CAGCCCMCGT 780 
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GGTTTCTGTG GCAGAATCTG GTTCYATAMC AAGTTCCTAA TAANCTGTAS CCNAAAAAAT 
TTGATGAGGT ATTATAATTA TTTCAATATA AAGCACCCAC TAGATGGAGC CAGTGTCTGC 
TTCACATGTT AAGTCCTTCT TTCCATATGT TAGACATTTT CTTTGAAGCA ATTTTAGAGT 
GTAGCTGTTT TTCTCAGGTT AAAAATTCTT AGCTAGGATT GGTGA6TTGG GGAAAAGTGA 
CTTATAAGAT NCGAATTGAA TTAAGAAAAA GAAAATTCTG TGTTGGAGGT GGTAATGTGG 
KTGGTGATCT YCATTAACAC TGANCTAGGG CTTTKGKGTT TGKTTTATTG TAGAATCTAT 
ACCCCATTCA CAGAAGATAC CGAGACTGTG GGCCAGAGAG CCCTGCACTC AATTCTGAAT 
GCTGCCATCA TGATCAGTGT CATTGTTGTC ATGACTATCC TCCTGGTGGT TCTGTATAAA 
TACAGGTGCT ATAAGGTGAG CATGAGACAC AGATCTTTGN TTTCCACCCT GTTCTTCTTA 
TGGTTGGGTA TTCTTGTCAC AGTAACTTAA CTGATCTAGG AAAGAAAAAA TGTTTT G TCT 
TCTAGAGATA AGTTAATTTT TAGTTTTCIT CCTCCTCACT GTGGAACATT CAAAAAATAC 
AAAAAGGAAG CCAGGTGCAT GTGTAATGCC AGGCTCAGAG GCTGAGGCAG GAGGATCGCT 
TGGGCCCAGG AGTTCACAAG CAGCTTGGGC AACGTAGCAA GACCCTGCCT CTATTAAAGA 
AAAGAAAAAA CAAATATTGG AAGTATTTTA TATGCATGGA ATCTATATGT CATGAAAAAA 
TTAGTGTAAA ATATATATAT TATGATTAGN TATCAAGATT TAGTGATAAT TTATGTTATT 
TTGGGATTTC AATGCCTTTT TAGGCCATTG TCTCAAMAAA TAAAAGCAGA AAAGAAAAAA 
AGTTGTAACT GAAAAATAAA CATTTCCATA TAATAGCACA ATCTAAGTGG GTTTTTGNTT 
GTTTGTTTGN TTGTTGAAGC AGGGCCTTGC CCTNYCACCC AGGNTGGAGT GAAGTGCAGT 
GGCACGATTT TGGCTCACTG GAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBONESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1,.823 

(D) OTHER INFORMATION: /note- "lEx7" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CAGGAGTGGA CTAGGTAAAT GNAA6NTGTT TTAAAGAGAG ATGNGGNCNG GGACATAGTG 
GTACACANCT GTAATGCTCA. NCACTKATGG GGA6TACTGA AGGNGGNSGG ATCACTTGNG 
GGTCNGGAAT NTGAGANCAG CCTGGGCAAN ATGGCGAAAC CCTGTCTCTA CTAAAAATAG 
CCANAAHNWA GCCTAGCGTG GTGGOGCRCA CGCGTOTT^ CACCTACTCA GGAGGdifTAA 
GCACGAGNAN TNCTTGAACG CAGGAGGCAG AGGNTGTGGT GARCTGAGAT CGTGCGACTG 
CACTCCAGTC TGGGCGACMA AGTGAGACCC TOTCTCCNNN AAGAAAAAAA AAATCTGTAC 
TTTTTAAGGG TTGTGOGACC TGTTAATTAT ATTGAAATGC TTCTYTTCTA GGTCATCCAT 
GCCTGGCTTA TTATATCATC TCTATTOTTO CTOTTCTTTT TTTCATTCAT TTACTTGOGG 
TAAGTTGTGA AATTTGGGGT CTOTCTTTCA OAATTAACTA CCTNNGTGCT GTGTAGCTAT 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1883 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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CATTTAAAGC CATGTACTTT GNTGATGAAT TACTCTGAAG TTTTAATTGT NTCCACATAT 
AGGTCATACT TGGTATATAA AAGACTAGNC AGTATTACTA ATTGAGACAT TCTTCTGTNG 
CTCCTNGCTT ATAATAAGTA GAACTGAAAG NAACTTAAGA CTACAGTTAA TTCTAAGCCT 
TTGGGGAAGG ATTATATAGC CTTCTAGTAG GAAGTCTTGT GCNATCAGAA TGTTTNTAAA 
GAAAGGGTNT CAAGciATNG TATAAANACC AAAAATAATT GAT 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1. .945 

(D) OTHER INFORMATION: /note« "lEx8" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTTNTCCNAA CCAACTTAGG AGNTTGGACC TGGGRAAGAC CNACNTGATC TCCGGGAGGN 
AAAGACTNCA GTTGAGCCGT GATTGCACCC ACTTTACTCC AAGCCTGGGC AACCAAAATG 
AGACACTGGC TCCAAACACA AAAACAAAAA CAAAAAAAGA GTAAATTAAT TTANAGGGAA 
GNATTAAATA AATAATAGCA CAGTTGATAT AGGTTATCGT AAAATTATAA AGGTGGGANA 
TTAATATCTA ATGTTTGGGA GCCATCACAT TATTCTAAAT AATGTTTTGG TGGAAATTAT 
TGTACATCTT TTAAAATCTG TGTAATTTTT TTTCAGGGAA GTGTTTAAAA CCTATAACGT 
TGCTGTGGAC TACATTACTG TTGCACTCCT GATCTGGAAT TTTGGTGTGG TGGGAATCAT 
TTCCATTCAC TGGAAAGGTC CACTTCGACT CCAGCAGGCA TATCTCATTA TGATTAGTGC 
CCTCATGGCC CTGGTGTTTA TCAAGTACCT CCCTGAATGG ACTGCGTGGC TCATCTTGGC 
TGTGATTTCA GTATATGGTA AAACCCAAGA CTGATAATTT GTTTGTCACA GGAATGCCCC 
ACTGGAGTGT TTTCTTTCCT CATCTCTTTA TCTTQATTTA GAGAAAATGG TAACGTGTAC 
ATCCCATAAC TCTTCAGTAA ATCATTAATT AGCTATAGTA ACTTTTTCAT TTGAAGATTT 
CGGCTGGGCA TGGTAGCTCA TGCCTGTAAT CTTAGCACTT TGGGAGGCTG AGGCGGGCAG 
ATCACCTAAG CCCAGAGTTC AAGACCAGCC TGGGCAACAT GGCAAAACCT CGTATCTACA 
GAAAATACAA AAATTAGCCG GGCATGGTGG TGCACACCTG TAGTTCCAGC TACTTAGGAG 
GCTGAGGTGO GAGGATCGAT TGATCCCAGG AGGTCAAONC TGCAG 
(2) INFORMATION FOR SEQ ID NO: 11: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
CTGCAQCTTT CCTTTAAACT AGGAAGACTT GTTCCTATAC CCCAOTAACG ATACACTGTA 



600 
660 
720 
780 
823 



60 
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180 
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300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



60 
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CACTAAGCAA ATAGCAGTCA AACCCAAATG AAATTTNTAC AGATGTTCTG TGTCATTTTA 120 

TNTTGTTTAT GTTGTCTCCC CCACCCCCAC CAGTTCACCT GCCATTTATT TCATATTCAT 180 

TCAACGTCTN NNTGTGTAAA AAGAGACAAA AAACATTAAA CTTTTTTCCT TCGTTAATTC 240 

CTCCCTACCA CCCATTTACA AGTTTAGCCC ATACATTTTA TTAGATGTCT TTTATGTTTT 300 

TCTTTTNCTA GATTTAGTGG CTGTTTTGTG TCCGAAAGGT CCACTTCGTA TGCTGGTTGA 360 

AACAGCTCAG GAGAGAAATG AAACGCTTTT TCCAGCTCTC ATTTACTCCT GTAAGTATTT 4 20 

GGAGAATGAT ATTGAATTAG TAATCAGNGT AGAATTTATC GGGAACTTGA AGANATGTNA 4 80 

CTATGGCAAT TTCANGGNAC TTGTCTCATC TTAAATGANA GNATCCCTGG ACTCCTGNAG 540 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 509 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

<AI NAME/KEY: misc^feature 

(B) LOCATION: 1..509 

(D) OTHER INFORMATION: /note« "lExlO" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCCCGTCNAT GCATACTTTG TGTGTCCAGT GCTTACCTGG AATCCNGTCT TTCCCAACAG 60 

CAACAATGGT GTGGTTGGTG AATATGGCAG AAGGAGACCC GGAAGCTCAA AGGAGAGTAT 120 

CCAAAAATTC CAAGTATAAT GCAGAAAGTA GGTAACTYYY NTTAGATAMN ATCTTGATTT 180 

TNCAGGGTCA CTCTTATAAG CTAACAGTAT AGNAATGTTT TTATCGTCTT TCTNKGGNCA 240 

TAGACTCCTN KGAGAATCTC TTGAGAACTA TGATAATGCC CAGTAAATAC NCAGATAAGT 300 

ATTTAAGGAG TNCAGATACT CAAANCCCAA CAATACNGTC AAAGCATCCT AGGTTAAGAC 360 

AMCNCCCATT AAATACAGAA TACCAGCATG GAAAGGTTCA GGCTGAGGTT ATGATTGGGT 420 

TTGGGTTTTG GGNNNGTTTT TTATAAGTCA TGATTTTAAA AAGAAAAAAT AAACTCTCTC 480 
CAAACATGTA AAAGTAAGAA TCTCCTAAA 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



509 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: i; .1092 

(D) OTHER INFORMATION: /note« "lExll* 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GTCTAGATAA GNCAACATTC AGGGGTAGAA GGGGACTGTT TATTTTTTCC TTTAOTCTCT 
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CTTAAAGAGT GACy^AAAATT TTCCCAGGAA TCCCGGTGGA 
GTTCATACCA AGTTACAACC CCACAACCTT AGAGCTTTTG 
ATTACCGTGC TTGGCTTGGC TTGGTCAGGA TTCACCACCA 
GGAACCCAAA CAATTCAGGA TTCTGCCCTC AGGAAATAAA 
AAACTACCAG CAGGCACTGC TACAGCCCAT GCTTTGTGGT 
ATGACAGCTA GTTACTGTTT CCATGTAATT TTCTTAAAGG 
TAGAGCTGTA ACTTCCACTT TCTCTTGAAG GCACAGAAAG 
CAGAGAATGA TGATGGCGGG TTCAGTGA6G AATGGGAAGC 
GGCCTCATCG CTCTACACCT GAGTCACGAG CTGCTGTCCA 
TCGCTGGTGA AGACCCAGAG GAAAGTATGT TCANTTCTCC 
CCTTTAGGTA GCTACATTAT CAACCTTTTT GAGAATAAAA 
CTAATTCTAT ATCACATGTA ACTTTTATTT GGATATATCA 
TTTTTTTTTT TTTTTTTTTT TTTTNGGNGA NAGAGTCTCG 
GCAATGGTGC GATCTTGGCT CACTGAAAGC TCCACCNCCC 
CCTCAGCCNC CCAAGTAGNT GGGACTACAG GGGTGCGCCA 
GGNTTTTTAG TAGAGATGGC GTTTCACCAN CTTGGNGCAG 
NATCATGATC TGCCTGCCTT AGCCTCCCCA AAGTGCTGGG 
TTCCTGGGCC TC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



CTTTGCTTCA 
TTAGGAAGAG 
GAGTCATGTG 
GGAGAAAATA 
TTAAGGGCCA 
TATTAAATTT 
GGAGTCACAA 
CCAGAGGGAC 
GGAACTTTCC 
ATNTTTCAAA 
TGAATTGAGA 
GTAATAGTGC 
CTCTGTCGCC 
GGGTTCAAGT 
CCACGCCTGG 
GCTGGTCTTG 
ATTNCAGGGG 



CCACTCATAG 
GCTTGGTQGG 
GGAGGGGGTG 
GCTGTTOGAT 
GCTAGTTACA 
TTCTAAATAT 
GACACTGTTG 
AGTCATCTAG 
AGCAGTATCC 
GTCATGGATT 
GTGTTACAGT 
TTTTTYNTTT 
AGGTTGGAGT 
GATTCTCCTG 
GATAATTTTG 
GAACTCCTGA 
TGAGCCACTG 



120 
180 
240 
300 
360 
420 
460 
540 
600 
€60 
720 
780 
840 
900 
960 
1020 

loeo 

1092 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1. .1003 

(D) OTHER INFORMATION: /note- •lExl2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
CTGCAGTGAG CCGAGATCAT GCTGCTGTAC TCCAGCCTGG GCCACAGAGC CAAACTCCAT 
CTCCCAAAAA AAAAAAATAT TAATTAATAT GATNAAATCA TGCCTATCTC AGAATTCTTG 
TAAGGATTTC TTAGKACAAG TGCTGGGTAT AAACTATANA TTCRATAGAT GNCGATTATT 
ACTTAYTATT GTTATTGATA AATAACAGCA GCATCTACAG TTAAGACTCC AGAGTCAGTC 
ACATAGAATC TGGNACTCCT ATTGTAONAA ACCCCNHMAS AAAGAAAACA CAGCKWiAOC 
CTAATTTTGT ATATCATTTA CTGACTTCTC TCATTCATTG TGOGGTTGAG TAGGGCAGTG 
ATATTTTTGA .ATTGTGAAAT CATANCAAAG AGTGACCAAC TTTTTAATAT TTGTAACCTT 
TCCTTTTTAG GGGGAGTAAA ACTTGGATTG OGAQATTTCA TTTTCTACAG TGTTCTGGTT 
GGTAAAGCGT CAGCAACAGC CAGTGGAGAC IWAACACAA CCATAGCCTO TTTCGTAGCC 
ATATTAATTG TMMSTATACA CTAATAAGAA TGtOTCAGAG CTCTTAATOT CMAAACTTTG 



60 
120 
180 
240 
300 
360 
420 
480 
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ATTACACAGT 


CCCTTTAAGG 


CAGTTCTGTT TTAACCCCAG GTGGQTTAAA 


TATTCCAGCT 


OOU 


ATCTGAGGAG 


CTTTTNGATA 


ATTGGACCTC ACCTTAGTAG TTCTCTACCC 


TGGCCACACA 


/ « u 






TTAAAACTGT AAGCTCTGCC CTGAGATATT 


CTTACTCAAT 


780 


TTAATTGTGT 


AGTTTTTAAA 


ATTCCCCAGG AAATTCTGGT ATTTCTGTTT AGGAACCGCT 


840 


GCCTCAAGCC 


TAGCAGCACA 


GATATGTAGG AAATTAGCTC TGTAAGGTTG 


GTCTTACAGG 


900 


GATAAACAGA 


TCCTTCCTTA 


GTCCCTGGAC TTAATCACTG AGAGTTTGGG 


TGGTGGTTTT 


960 


GGATTTAATG 


ACACAACCTG 


TAGCATGCAG TGTTACTTAA GAC 




1003 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..736 

(D) OTHER INFORMATION: /note- "lExl3" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



GTCTTTCCCA 


TCTTCTCCAC 


AGGGTTTGTG CCTTACATTA TTACTCCTTG 


CCATTTTCAA 


€0 


GAAAGCATTG 


CCAGCTCTTC 


CAATCTCCAT CACCTTTGGG CTTGTTTTCT 


ACTTTGCCAC 


120 


AGATTATCTT 


GTACAGCCTT 


TTATGGACCA ATTAGCATTC CATCAATTTT 


ATATCTAGCA 


180 


TATTTGCGGT 


TAGAATCCCA 


TGGATGTTTC TTCTTTGACT ATAACAAAAT CTGGGGAGQA 


240 


CAAAGGTGAT 


TTCCTGTGTC 


CACATCTAAC AAATCAAGAT CCCOGGCTGG ACTTTTGGAG 


300 


GTTCCTTCCA 


AGTCTTCCTG 


ACCACCTTGC ACTATTGGAC TTTGGAAGGA GGTGCCTATA 


3€0 


GAAAACGATT 


TTGAACATAC 


TTCATCGCAG TGGACTGTGT CCTCGGTGCA 


GAAACTACCA 


420 


GATTTGAGGG 


ACGAGGTCAA 


GGAGATATGA TAGGCCCGGA AGTTGCTGTG 


CCCCATCAGC 


480 


AGCTTGACGC 


GTGGTCACAG 


GACGATTTTC ACTGACACTG CGAACTCTCA 


GGACTACCGT 


540 


TACCAAGAGG 


TTAGGTGAAG 


TGGTTTAAAC CAAACGGAAC TCTTCATCTT 


AAACTACACG 


600 


TTGAAAATCA 


ACCCAATAAT 


TCTGTATTAA CTGAATTCTG AACTTTTCAG 


GAGGTACTGT 


660 


GAGGAAGAGC 


AGGCACCACC 


AGCAGAATGG GGAATGGAGA GGTGGGCAGG 


GGTTCCAGCT 


720 


TCCCTTTGAT 


TTTTTG 






736 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 188.. 1566 
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(ix) FEATURE: 

(A) NAME/KEY: tnisc feature 

(B) LOCATION: 1 . . 1964 

(D) OTHER INFORMATION; /note« "mPSl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 

ACCANACANC GGCAGCTGAG GCGGAAACCT AGGCTGCGAG CCGGCCGCCC GGGCGCGGAG 60 

AGAGAAGGAA CCAACACAAG ACAGCAGCCC TTCGAGGTCT TTAGGCAGCT TGGAGGAGAA 120 

CACATGAGAG AAAGAATCCC AAGAGGTTTT GTTTTCTTTG AGAAGGTATT TCTGTCCAGC 180 

TGCTCCA ATG ACA GAG ATA CCT GCA CCT TTG TCC TAC TTC CAG AAT GCC 229 
Met Thr Glu He Pro Ala Pro I*eu Ser Tyr Phe Gin Asn Ala 
15 10 

CAG ATG TCT GAG GAC AGC CAC TCC AGC AGC GCC ATC CGG AGC CAG AAT 277 
Gin Met Ser Glu Asp Ser Hie Ser Ser Ser Ala He Arg Ser Gin Asn 
15 20 25 30 

GAC AGC CAA GAA CGG CAG CAG CAG CAT GAC AGG CAG AGA CTT GAC AAC 325 
Asp Ser Gin Glu Arg Gin Gin Gin His Asp Arg Gin Arg Leu Asp Asn 
35 40 45 

CCT GAG CCA ATA TCT AAT GGG CGG CCC CAG AGT AAC TCA AGA CAG GTG 373 
Pro Glu Pro He Ser Asn Gly Arg Pro Gin Ser Asn Ser Arg Gin Val 
50 55 60 

GTG GAA CAA GAT GAG GAG GAA GAC GAA GAG CTG ACA TTG AAA TAT GGA 421 
Val Glu Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly 
65 70 75 

GCC AAG CAT GTC ATC ATG CTC TTT GTC CCC GTG ACC CTC TGC ATG GTC 46 9 

Ala Lys His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val 
80 85 90 

GTC GTC GTG GCC ACC ATC AAA TCA GTC AGC TTC TAT ACC CGG AAG GAC 517 
Val Val Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp 
95 100 105 110 

GGT CAG CTA ATC TAC ACC CCA TTC ACA GAA GAC ACT GAG ACT GTA GGC 565 
Gly Gin Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly 
115 120 125 

CAA AGA GCC CTG CAC TCG ATC CTG AAT GCG GCC ATC ATG ATC AGT GTC 613 
Gin Arg Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val 
130 135 140 

ATT GTC ATT ATG ACC ATC CTC CTG GTG GTC CTG TAT AAA TAC AGG TGC 661 
He Val He Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Ara Cvs 
145 ISO 155 

TAC AAG GTC ATC CAC GCC TGG CTT ATT ATT TCA TCT CTG TTG TTG CTG 709 
Tyr Lys Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu 
160 165 170 

TTC TTT TTT TCG TTC ATT TAC TTA GGG GAA GTA TTT AAG ACC TAC AAT 757 
Phe Phe Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn 
175 180 185 190 

GTC GCC GTG GAC TAC GTT ACA GTA GCA CTC CTA ATC TGG AAT TTT GGT 805 
Val Ala Val Asp Tyr Val Thr Val Ala Leu Leu lie Trp Asn Phe Gly 
195 200 205 

GTG GTC GGG ATG ATT GCC ATC CAC TGG AAA GGC CCC CTT CGA CTG CAG 853 
Val Val Gly Met He Ala He His Trp Lys Gly pro Leu Arg Leu Gin 
2X0 215 220 

CAG GCG TAT CTC ATT ATG ATC AGT GCC CTC ATG GCC CTG GTA TTT ATC 901 
Gin Ala Tyr Leu He Met He Ser Ala Utu Met Ala Leu Val Phe He 
225 230 235 
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AAG TAC CTC CCC GAA TGG ACC OCA TGG CTC ATC TTG GCT GTG ATT TCA 949 
Lys Tyr Leu Pro Glu Trp Thr Ala Trp Leu lie Leu Ala Val He Ser 
240 245 250 

GTA TAT GAT TTG GTG GCT GTT TTA TGT CCC AAA GGC CCA CTT CGT ATG 997 
Val Tyr Aap Leu Val Ala Val Leu Cye Pro Lys Gly Pro Leu Arg Met 
255 260 265 270 

CTG GTT GAA ACA GCT CAG GAA AGA AAT GAG ACT CTC TTT CCA GCT CTT 1045 
Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu 
275 280 285 

ATC TAT TCC TCA ACA ATG GTG TGG TTG GTG AAT ATG GCT GAA GGA GAC 1093 
He Tyr Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp 
290 295 300 

CCA GAA GCC CAA AGG AGG GTA CCC AAG AAC CCC AAG TAT AAC ACA CAA 1141 
Pro Glu Ala Gin Arg Arg Val Pro Lys Asn Pro Lys Tyr Asn Thr Gin 
305 310 315 

AGA GCG GAG AGA GAG ACA CAG GAC AGT GGT TCT GGG AAC GAT GAT GGT 1189 
Arg Ala Glu Arg Glu Thr Gin Asp Ser Gly Ser Gly Asn Asp Asp Gly 
320 325 330 

GGC TTC AGT GAG GAG TGG GAG GCC CAA AGA GAC AGT CAC CTG GGG CCT 1237 
Gly Phe Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro 
335 340 345 350 

CAT CGC TCC ACT CCC GAG TCA AGA GCT GCT GTC CAG GAA CTT TCT GGG 1285 
His Arg Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Gly 
355 360 365 

AGC ATT CTA ACG AGT GAA GAC CCG GAG GAA AGA GGA GTA AAA CTT GGA 1333 
Ser He Leu Thr Ser Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Glv 
370 375 380 

CTG GGA GAT TTC ATT TTC TAC AGT GTT CTG GTT GGT AAG GCC TCA GCA 1381 
Leu Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala 
385 390 395 

ACC GCC AGT GGA GAC TGG AAC ACA ACC ATA GCC TGC TTT GTA GCC ATA 1429 
Thr Ala Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He 
400 405 410 

CTG ATC GGC CTG TGC CTT ACA TTA CTC CTG CTC GCC ATT TTC AAG AAA 1477 
Leu He Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lys Lys 
415 420 425 430 

GCG TTG CCA GCC CTC CCC ATC TCC ATC ACC TTC GGG CTC GTG TTC TAC 1525 
Ala Leu Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr 
435 440 445 

TTC GCC ACG GAT TAC CTT GTG CAG CCC TTC ATG GAC CAA CTT GCA TTC 1573 
Phe Ala Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe 
450 455 460 

CAT CAG TTT TAT ATC TAGCCTTTCT GCAGTTAGAA CATGGATGTT TCTTCTTTGA 1628 
His Gin Phe Tyr He 
465 

TTATCAAAAA CACAAAAACA GAGAGCAAGC CCGAGGAGGA GACTGGTGAC TTTCCTGTGT 1688 

CCTCAGCTAA CAAAGGCAGG ACTCCAGCTG GACTTCTGCA GCTTCCTTCC GAGTCTCCCT 1748 

AGCCACCCQC ACTACTGGAC TGTGGAAGGA AGCGTCTACA GAGGAACGGT TTCCAACATC 1808 

CATCGCTGCA GCAGACGGTG TCCCTCAGTG ACTTGAOAGA CAAOGACAAO GAAATGTOCT 1868 

GGGCCAAGGA GCTGCCGTGC TCTGCTAGCT TTGACCGTGG GCATGGAGAT TTACCCGCAC 1928 

TGTGAACTCT CTAAGGTAAA CAAAGTGAGG TCSAACC 1954 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4S7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Thr Glu lie Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala Gin Met 
1 5 10 15 

Ser Glu Asp Ser His Ser Ser Ser Ala lie Arg Ser Gin Asn Asp Ser 
20 25 30 

Gin Glu Arg Gin Gin Gin His Asp Arg Gin Arg Leu Asp Asn Pro Glu 
35 40 45 

Pro He Ser Asn Gly Arg Pro Gin Ser Asn Ser Arg Gin Val Val Glu 
50 55 60 

Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala Lys 
^5 70 75 80 

His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val Val Val 
85 90 95 

Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly Gin 
100 105 110 

Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin Ara 
115 120 125 

Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val He Val 
130 13S 140 

He Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tyr Lys 
^*5 ISO 155 

Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu Phe Phe 
165 170 175 

Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val Ala 
180 1B5 190 

Val Asp Tyr Val Thr Val Ala Leu Leu He Trp Asn Phe Gly Val Val 
155 200 205 

Gly Met He Ala He His Trp Lys Gly Pro Leu Arg Leu Gin Gin Ala 
210 215 220 

Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He Lys Tyr 
225 230 235 240 



Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser Val Tyr 
245 250 255 

Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu Val 
260 265 270 

Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu lie Tyr 
275 280 285 

Wet val Trp Leu Val Asn Met Ala Glu Gly Asp Pro Glu 
295 300 

Ala Gin Arg Arg Val Pro Lys Asn Pro Lys Tyr Asn Thr Gin Arg Ala 

310 315 320 

Glu Arg Glu Thr Gin Asp Ser Gly Ser Gly Asn Asp Asp Gly Gly Phe 

325 330 ... 335 
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Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro His Arq 
340 345 350 

Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Gly Ser lie 
355 360 365 

Leu Thr Ser Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu Glv 
370 375 380 

Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr Ala 
385 390 395 400 

Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He Leu He 
405 410 415 

Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala Leu 
420 425 430 

Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr Phe Ala 
435 440 445 

Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe His Gin 
450 455 460 

Phe Tyr He 
465 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 366.. 1712 

(ix) FEATURE: 

(A) NAME/KEY: mi«c_feature 

(B) LOCATION: 1..2226 

(D) OTHER INFORMATION: /note« "hPS2" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GAATTCGGCA CGAGGGCATT TCCAGCAGTG AGGAGACAGC CAGAAGCAAG CTTTTGGAGC 60 

TGAAGGAACC TGAGACAGAA GCTAGTCCCC CCTCTGAATT TTACTGATGA AGAAACTGAG 120 

GCCACAGAGC TAAAGTGACT TTTCCCAAGQ TCGCCCAGCG AGGACGTGGG ACTTCTCAGA 180 

CGTCAGGAGA GTGATGTGAG GGAGCTGTGT GACCATAGAA AGTGACGTGT TAAAAACCAG 240 

CGCTGCCCTC TTTGAAAGCC AGGGAGCATC ATTCATTTAG CCTGCTGAGA AGAAGAAACC 300 

AAGTGTCCGG GATTCAAGAC CTCTCTGCGG CCCCAAGTGT TCGTGGTGCT TCCAGAGGCA 360 

GGGCT ATG CTC ACA TTC ATG GCC TCT GAC AGC GftG GAA GAA GTG TGT 407 
Met Leu Thr Phe Met Ala Ser Asp Ser Glu Glu Glu Val Cys 

1 5 . 10 

GAT GAG CGG ACG TCC CPA ATG TCG GCC GAG AGC CCC ACG CCG CGC tCC 455 
Asp Glu Arg Thr Ser Leu Met Ser Ala Glu Ser Pro Thr Pro Arg Ser 
IS 20 25 30 

TGC CAG GAG GGC AGG CAG GGC CCA GAG GAT GGA GAG AAT ACT GCC CAG 503 
Cys Gin Glu Gly Arg Gin Gly Pro Glu Asp Gly Glu Asn Thr Ala Gin 
35 40 45 
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TGG AGA AGC CAG GAG AAC GAG GAG GAC GOT GAG GAG GAC CCT GAC C6C 
Trp Arg Ser Gin Glu Asn Glu Glu Asp Gly Glu Glu Aap Pro Asp Arg 
50 SS 60 

TAT GTC TGT AGT GGG GTT CCC GGG CGG CCG CCA GGC CTG GAG GAA GAG 
Tyr Val Cys Ser Gly Val Pro Gly Arg Pro Pro Gly Leu Glu Glu Glu 
65 70 75 

CTG ACC CTC AAA TAC GGA GCG AAG CAT GTG ATC ATG CTC TTT GTG CCT 
Leu Thr Leu Lys Tyr Gly Ala Lys His Val He Met Leu Phe Val Pro 
80 eS 90 

GTC ACT CTG TGC ATG ATC GTG GTG GTA GCC ACC ATC AAG TCT GTG CGC 
Val Thr Leu Cys Met He Val Val Val Ala Thr He Lys Ser Val Arg 
95 100 105 iiS 

t?^ AAG AAT GGA ORG CTC ATC TAC ACG CCA TTC ACT GAG 
Phe Tyr Thr Glu Lys Asn Gly Gin Leu He Tyr Thr Pro Phe Thr Glu 
115 120 125 

S.^^ CTC CTC AAC TCC GTG CTG AAC ACC 

Asp Thr Pro Ser Val Gly Gin Arg Leu Leu Asn Ser Val Leu Asn Thr 

135 140 

CTC ATC ATG ATC AGC GTC ATC GTG GTT ATG ACC ATC TTC TTG GTG GTG 
Leu He Met He Ser Val He Val Val Met Thr He Phe Leu Val Val 
l«5 150 J55 

CTC TAC AAG TAC CGC TGC TAC AAG TTC ATC CAT GGC TGG TTO ATC ATG 
Leu Tyr Lys Tyr Arg Cys Tyr Lys Phe He His Gly Tr? He 

165 

ler I« SS mJ? 7^ ^"^C TAC CTT GGG GAA 

Ser Ser Leu Met Leu Leu Phe Leu Phe Thr Tyr He Tyr Leu Gly Glu 

180 185 

GTG CTC AAG ACC TAC AAT GTG GCC ATG GAC TAC CCC ACC ere rrr rws 
val Leu Lys Thr Tyr Asn Val Ala Met ™r Sro fS SS ™ 2S 
1*5 200 205 

ACT GTC TOG AAC TTC GGG GCA GTG GGC ATG GTG TGC ATC CAC TGG AAG 
Thr val Trp Asn Phe Gly Ala Val Gly Met Val Cys He His ?S IJS 

GGC CCT CTG GTG CTG CAG CAG GCC TAC CTC ATC ATG ATC AGT GOG CTC 
Gly Pro Leu Val Leu Gin Gin Ala Tyr Leu He SJ? ill t« IJS SS 

230 235 

SJ? SS Si 51? SS i?S «^ GCG TGG GTC 

Met Ala Leu Val Phe He Lys Tyr Leu Pro Glu Trp Ser Ala Trp Val 

245 250 

ATC CTG GGC GCC ATC TCT GTG TAT GAT CTC GTC OCT GTG CTO TOT CCC 
He Leu Gly Ala He Ser Val Tyr Asp Leu Val Ala vS SS SS 

260 265 270 

AAA GGG CCT CTC AGA ATG CTG GTA GAA ACT GCC CAG GAG AGA AAT GAG 
Lys Gly Pro Leu Arg Met Leu Val Glu Thr Ala Gin Glu Arg JiJ §fS 
275 280 285 

tT* 11^ S"" CTG ATA TAC TCA TCT GCC ATC GTC TGG ACG GTT 

Pro He Phe Pro Ala Leu He Tyr Ser Ser Ala Met Val Trp Thr vS 

.295 300 

GGC ATC GCG AAG CTC GAC CCC TCC TCT CAG GOT GCC CTC CAG CTC crr 
Oly Met Ala Lys L*„ Asp Pro Ser- Ser Oin. Gly SS SS SS SS £S 

310 21S 

<^C TCC TAT GAC AGT TTT GGG GAG CCT 
Tyr Asp Pro Glu Met Glu Glu Asp Ser Tyr Asp Ser Phe Oly SS iSJ 

325 



551 



599 



647 



695 



743 



791 



839 



887 



935 



983 



1031 



1079 



1127 



1175 



1223 



1271 



1319 



1367 
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TCA TAC CCC GAA GTC TTT GAG CCT CCC TTG ACT GGC TAC CCA GGG GAG 1415 
Ser Tyr Pro Glu Val Phe Glu Pro Pro Leu Thr Gly Tyr Pro Gly Glu 
335 340 345 350 

GAG CTG GAG GAA GAG GAG GAA AGG GGC GTG AAG CTT GGC CTC GGG GAC 1463 
Glu Leu Glu Glu Glu Glu Glu Arg Gly Val Lys Leu Gly Leu Gly Asp 
355 360 365 

TTC ATC TTC TAC AGT GTG CTG GTG GGC AAG GCG GCT GCC ACG GGC AGC 1511 
Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ala Ala Thr Gly Ser 
370 375 380 

GGG GAC TGG AAT ACC ACG CTG GCC TGC TTC GTG GCC ATC CTC ATT GGC 1559 
Gly Asp Trp Asn Thr Thr Leu Ala Cys Phe Val Ala He Leu He Gly 
385 390 395 

TTG TGT CTG ACC CTC CTG CTG CTT GCT GTG TTC AAG AAG GCG CTG CCC 1607 
Leu Cys Leu Thr Leu Leu Leu Leu Ala Val Phe Lys Lys Ala Leu Pro 
400 405 410 

GCC CTC CCC ATC TCC ATC ACG TTC GGG CTC ATC TTT TAC TTC TCC ACG 1655 
Ala Leu Pro He Ser He Thr Phe Gly Leu He Phe Tyr Phe Ser Thr 
415 420 425 430 

GAC AAC CTG GTG CGG CCG TTC ATG GAC ACC CTG GCC TCC CAT CAG CTC 1703 
Asp Asn Leu Val Arg Pro Phe Met Asp Thr Leu Ala Ser His Gin Leu 
435 440 445 

TAC ATC TGA GGGACATGGT GTGCCACAGG CTGCAAGCTG CAGGGAATTT 1752 
Tyr He ♦ 

TCATTGGATG CAGTTGTATA GTTTTACACT CTAGTGCCAT ATATTTTTAA GACTTTTCTT 1812 

TCCTTAAAAA ATAAAGTACG TGTTTACTTG GTGAGGAGGA GGCAGAACCA GCTCTTTGGT 1872 

GCCAGCTGTT TCATCACCAG ACTTTGGCTC CCGCTTTGGG GAGCGCCTCG CTTCACGGAC 1932 

AGGAAGCACA GCAGGTTTAT CCAGATGAAC TGAGAAGGTC AGATTAGGGT GGGGAGAAGA 1992 

GCATCCGGCA TGAGGGCTGA GATGCCCAAA GAGTGTGCTC GGGAGTGGCC CCTCGCACCT 2052 

GGGTGCTCTG GCTGGAGAGG AAAAGCCAGT TCCCTACGAG GAGTGTTCCC AATGCTTTGT 2112 

CCATGATGTC CTTGTTATTT TATTNCCYTT ANAAACTGAN TCCTNTTNTT NTTDCGGCAG 2172 

TCACMCTNCT GGGRAGTGGC TTAATAGTAA NATCAATAAA NAGNTGAGTC CTNTTAG 2229 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu Thr Phe Met Ala Ser Asp Ser Glu Glu Glu Val Cys Asp Glu 
1 5 10 15 

Arg Thr Ser Leu Met Ser Ala Glu Ser Pro Thr Pro Arg Ser Cys Gin 
20 25 30 

Glu Gly Arg Gin Gly Pro Glu Asp Gly Glu Asn Thr Ala Gin Trp Arg 
35 40 45 

Ser Gin Glu Asn Glu Glu Asp Gly Glu Glu Asp Pro Asp Arg Tyr Val 
50 55 60 
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Cys Ser Gly Val Pro Gly Arg Pro Pro Gly Leu Glu Glu Glu Leu Thr 
65 70 75 BO 

Leu Lys Tyr Gly Ala Lys His Val lie Met Leu Phe Val Pro Val Thr 
85 90 95 

Leu Cys Met lie Val Val Val Ala Thr lie Lys Ser Val Arg Phe Tyr 
100 105 110 

Thr Glu Lys Asn Gly Gin Leu He Tyr Thr Pro Phe Thr Glu Asp Thr 
115 120 125 

Pro Ser Val Gly Gin Arg Leu Leu Asn Ser Val Leu Asn Thr Leu He 
130 135 140 

Met He Ser Val He Val Val Met Thr He Phe Leu Val Val Leu Tyr 
145 150 155 160 

Lys Tyr Arg Cys Tyr Lys Phe He His Gly Trp Leu He Met Ser Ser 
165 170 175 

Leu Met Leu Leu Phe Leu Phe Thr Tyr He Tyr Leu Gly Glu Val X-eu 
180 IBS 190 

Lys Thr Tyr Asn Val Ala Met Asp Tyr Pro Thr Leu Leu Leu Thr Val 
195 200 205 

Trp Asn Phe Gly Ala Val Gly Met Val Cys He His Trp Lys Gly Pro 
210 215 220 

Leu Val Leu Gin Gin Ala Tyr Leu He Met He Ser Ala Leu Met Ala 
225 230 235 240 

Leu Val Phe He Lys Tyr Leu Pro Glu Trp Ser Ala Trp Val He Leu 
245 250 255 

Gly Ala He Ser Val Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly 
260 265 270 

Pro Leu Arg Met Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Pro He 
275 280 285 

Phe Pro Ala Leu He Tyr Ser Ser Ala Met Val Trp Thr Val Gly Met 
290 295 300 

Ala Lys Leu Asp Pro Ser Ser Gin Gly Ala Leu Gin Leu Pro Tyr Asp 
305 310 315 320 

Pro Glu Met Glu Glu Asp Ser Tyr Asp Ser Phe Gly Glu Pro Ser Tyr 
325 330 335 

Pro Glu Val Phe Glu Pro Pro Leu Thr Gly Tyr Pro Gly Glu Glu Leu 
340 345 350 

Glu Glu Glu Glu Glu Arg Gly Val Lys Leu Gly Leu Gly Asp Phe He 
355 360 365 

Phe Tyr Ser Val Leu Val Gly Lys Ala Ala Ala Thr Gly Ser Gly Asp 
370 375 380 



Trp Asn Thr Thr Leu Ala Cys Phe Val Ala He Leu He Gly Leu Cys 

385 390 395 . 400 

Leu Thr Leu Leu Leu Leii Ala Val Phe Lys Lys Ala Leu Pro Ala Leu 

405 410 415 

Pro He Ser He Thr Phe Gly Leu He Phe Tyr Phe Ser Thr Asp Asn 

420 425 430 

Leii Val Arg Pro Phe Met Asp Thr Leu Ala Ser His Gin Leu Tyr He 

435 440 445 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQXTCNCE CHARACTERISTICS: 

(A) LENGTH: 1895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 140.. 1762 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 1895 

(D) OTHER INFORMATION: /note- "DmPS" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TATATGAGTC GCTTTAAAAC AAAAGAAAGT TTTTACCAGC TACATTCCTT TGGTTTCCTT 60 

AACTAAATCC CATCACACAA CTACGGCTTC GCAGGGGGAG GCGTCCAGCG CTACGGAGGC 120 

GAACGAACGC ACACCACTG ATG GCT GCT GTC AAT CTC CAG GCT TCG TGC TCC 172 

Met Ala Ala Val Asn Leu Gin Ala Ser Cys Ser 
1 5 10 

TCC GGG CTC GCC TCT GAG GAT GAC GCC AAT GTG GGC AGC CAG ATA GGC 220 
Ser Gly Leu Ala Ser Glu Asp Asp Ala Asn Val Gly Ser Gin He Glv 
15 20 25 

GCG GCG GAG CGT TTG GAA CGA CCT CCA AGG CGG CAA CAG CAG CGG AAC 268 
Ala Ala Glu Arg Leu Glu Arg Pro Pro Arg Arg Gin Gin Gin Arg Asn 
30 35 40 

AAC TAC GGC TCC AGC AAT CAG GAT CAA CCG GAT GCT GCC ATA CTT GCT 316 
Aan Tyr Gly Ser Ser Asn Gin Asp Gin Pro Asp Ala Ala He Leu Ala 
45 50 55 

GTG CCC AAT GTG GTG ATG CGT GAA CCT TGT GGC TCG CGC CCT TCA AGA 364 
Val Pro Asn Val Val Met Arg Glu Pro Cys Gly Ser Arg Pro Ser Arc? 
€0 65 70 75 

CTG ACC GGT GGA GGA GGC GGC AGT GGT GGT CCG CCC ACA AAT GAA ATG 412 
Leu Thr Gly Gly Gly Gly Gly Ser Gly Gly Pro Pro Thr Asn Glu Met 
80 85 90 

GAG GAA GAG CAG GGC CTG AAA TAC GGG GCC CAG CAT GTG ATC AAG TTA 460 
Glu Glu Glu Gin Gly Leu Lys Tyr Gly Ala Gin His Val He Lys Leu 
95 100 105 

TTC GTC CCC GTC TCC CTT TGC ATG CTG GTA GTG GTG GCT ACC ATC AAC 508 
Phe Val Pro Val Ser Leu Cys Met Leu Val Val Val Ala Thr He Asn 
110 115 120 

TCC ATC AGC TTC TAC AAC AGC ACG GAT GTC TAT CTC CTC TAC ACA CCT 556 
Ser He Ser Phe Tyr Asn Ser Thr Asp Val Tyr Leu Leu Tyr Thr Pro 
125 130 135 

TTC CAT GAA CAA TCG CCC GAG CCT AGT GTT AAG TTC TGG AGT GCC TTG 604 
Phe His Glu Gin Ser Pro Glu Pro Ser Val Lys Phe Trp Ser Ala Leu 
140 • 145 . 150 155 

GCG AAC TCC CTG ATC CTG ATG AGC GTG GTG GTG GTG ATG ACC TTT TTG 652 
Ala Asn Ser Leu He -Leu Met Ser Val Val Val Val Met Thr Phe Leu 
160 165 170 

CTG ATT GTT TTG TAC AAG AAG CGT TGC TAT CGC ATC ATT CAC GGC TGG 700 
Leu He Val Leu Tyr Lys Lys Arg Cys Tyr Arg He He His Gly Trp 
175 180 185 
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CTG ATT CTC TCC TCC TTC ATG TTG TTG TTC ATT TTT ACQ TAG TTA TAT 748 
Leu lie Leu Ser Ser Phe Met Leu Leu Phe He Phe Thr Tyx Leu Tyr 
190 195 200 

TTG GAA GAG CTT CTT CGC GCC TAT AAC ATA CCG ATG GAG TAG CCT ACT 796 
Leu Glu Glu Leu Leu Arg Ala Tyr Asn He Pro Met Asp Tyr Pro Thr 
205 210 215 

GCA CTA CTG ATT ATG TGG AAC TTT GGA GTG GTC GGA ATG ATG TCC ATC 844 
Ala Leu Leu He Met Trp Asn Phe Gly Val Val Gly Met Met Ser He 
220 225 230 235 

CAT TGG CAG GGA CCT CTG CGG TTG GAG CAA GGA TAT CTC ATT TTC GTG 892 
His Trp Gin Gly Pro Leu Arg liCU Gin Gin Gly Tyr Leu He Phe Val 
240 245 250 

GCA GCC TTG ATG GCC TTG GTG TTC ATT AAA TAC CTG CCT GAA TGG ACT 940 
Ala Ala Leu Met Ala Leu Val Phe He Lys Tyr Leu Pro Glu Trp Thr 
255 260 265 

GCC TGG GCT GTA TTG GCT GCC ATT TCT ATT TGG GAT CTT ATT GCT GTC 988 
Ala Trp Ala Val Leu Ala Ala He Ser He Trp Asp Leu He Ala Val 
270 275 280 

CTT TCG CCA AGA GGA CCC CTC CGC ATT CTG GTG GAA ACG GCT CAG GAG 1036 
Leu Ser Pro Arg Gly Pro Leu Arg He Leu Val Glu Thr Ala Gin Glu 
285 290 295 

CGA AAT GAG CAA ATC TTC CCC GCT CTG ATT TAT TCA TCC ACT GTC GTT 1084 
Arg Asn Glu Gin He Phe Pro Ala Leu He Tyr Ser Ser Thr Val Val 
300 305 310 315 

TAC GCA CTT GTA AAC ACT GTT ACG CCG CAG CAA TCG CAG GCC ACA GCT 1132 
Tyr Ala Leu Val Asn Thr Val Thr Pro Gin Gin Ser Gin Ala Thr Ala 
320 325 330 

TCC TCC TCG CCG TCG TCC AGC AAC TCC ACC ACA ACC ACG AGG GCC ACG 1180 
Ser Ser Ser Pro Ser Ser Ser Asn Ser Thr Thr Thr Thr Arg Ala Thr 
335 340 345 

CAG AAC TCG CTG GCT TCG CCA GAG GCA GCA GCG GCT AGT GGC CAA CGC 1228 
Gin Asn Ser Leu Ala Ser Pro Glu Ala Ala Ala Ala Ser Gly Gin Arg 
350 355 360 

ACA GGT AAC TCC CAT CCT CGA CAG AAT CAG CGG GAT GAG GGC AGT GTA 1276 
Thr Gly Asn Ser His Pro Arg Gin Asn Gin Arg Asp Asp Gly Ser Val 
365 370 375 

CTG GCA ACT GAA GGT ATG CCA CTT GTG ACT TTT AAA AGC AAT TTG CGC 1324 
I/eu Ala Thr Glu Gly Met Pro Leu Val Thr Phe Lys Ser Asn Leu Arg 
380 385 390 395 

GGA AAC GCT GAG GCT GCG GGT TTC ACG CAA GAG TGG TCA GCT AAC TTG 1372 
Gly Asn Ala Glu Ala Ala Gly Phe Thr Gin Glu Trp Ser Ala Asn Leu 
400 405 410 

AGC GAA CGT GTG GCT CGT CGC CAG ATT GAA GTT CAA AGT ACT CAG AGT 1420 
Ser Glu Arg Val Ala Arg Arg Gin He Glu Val Gin Ser Thr Gin Ser 
415 420 425 

GGA AAC GCT CAG CGC TCC AAC GAG TAT AGG ACA GTA ACA GCT CCG GAT 1468 
Gly Asn Ala Gin Arg Ser Asn Glu Tyr Arg Thr Val Thr Ala Pro Asp 
430 435 440 

CAG AAT CAT CCG GAT GGG .CAA GAA GAA CGT GGC ATA AAG CTT GGC CTC 15X6 
Gin Asn His Pro Asp Gly Gin Glu Olu Arg Gly lle Lys Leu Gly Leu 
445 450 455 

GGC GAC TTC ATC TTC TAC TCG GTA TTA GTG GGC AAG GCC TCC AGC TAC 1564 
Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ser Tyr 
460 465 470 475 
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GGC GAC TGG ACG ACC ACA ATC OCT TGC TTT GTG GCC ATC CTC ATT GGA 1612 
Gly Asp Trp Thr Thr Thr He Ala Cya Phe Val Ala He Leu He Gly 
480 485 490 

CTC TGC CTC ACT CTT CTG CTT CTG GCC ATT TGG CGC AAG GCG CTA CCC 1660 
Leu Cys Leu Thr Leu Leu Leu Leu Ala He Trp Arg Lya Ala Leu Pro 
495 500 505 

GCC CTG CCC ATC TCA ATA ACG TTC GGA TTG ATA TTT TGC TTC GCC ACT 1708 
Ala Leu Pro He Ser He Thr Phe Gly Leu He Phe Cys Phe Ala Thr 
510 515 520 

AGT GCG GTG GTC AAG CCG TTC ATG GAG GAT CTA TCG GCC AAG CAG GTG 1756 
Ser Ala Val Val Lys Pro Phe Met Glu Asp Leu Ser Ala Lys Gin Val 
525 530 535 

TTT ATA TAAACTTGAA AAGACAAGGA CACATCAAGT GTCTTACAGT ATCATAGTCT iai2 

Phe He 

540 

AACAAAGCTT TTTGTAATCC AATTCTTTAT TTAACCAAAT GCATAGTAAC AACCTCGACT 1872 
AAAAAAAAAA AAAAAAAAAA AAA 



(2) INFORMATION FOR SEQ ID NO: 21: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Ala Ala Val Aan Leu Gin Ala Ser Cys Ser Ser Gly Leu Ala Ser 
15 10 15 

Glu Asp Asp Ala Asn Val Gly Ser Gin He Gly Ala Ala Glu Arg Leu 
20 25 30 

Glu Arg Pro Pro Arg Arg Gin Gin Gin Arg Asn Aan Tyr Gly Ser Ser 
35 40 45 

Asn Gin Asp Gin Pro Asp Ala Ala He Leu Ala Val Pro Asn Val Val 
50 55 60 

Met Arg Glu Pro Cys Gly Ser Arg Pro Ser Arg Leu Thr Gly Gly Gly 
^5 70 75 BO 

Gly Gly Ser Gly Gly Pro Pro Thr Asn Glu Met Glu Glu Glu Gin Gly 
85 90 95 

Leu Lys Tyr Gly Ala Gin His Val He Lys Leu Phe Val Pro Val Ser 
100 105 110 

Leu Cys Met Leu Val Val Val Ala Thr He Asn Ser He Ser Phe Tyr 
115 120 125 

Asn Ser Thr Asp Val Tyr Leu Leu Tyr Thr Pro Phe His Glu Gin Ser 
. 130 135 140 

Pro Glu Pro Ser Val Lys Phe Trp Ser Ala Leu Ala Asn Ser Leu He 
1*5 150 155 160 



Leu Met Ser Val Val Val Val Met Thr Phe Leu Leu He Val Leu Tyr 
165 170 175 

Lys Lys Arg Cys Tyr Arg He He His Gly Trp Leu He Leu Ser Ser 
180 185 190 



1895 



^IJR5TmiTP CUPCTfOlit c-7C\ 



wo 96/34099 




PCT/CA96/00263 



- 155 - 



Phe Met Leu Leu Phe lie Phe Thr Tyr Leu Tyr I*eu Glu Glu Leu Leu 
195 200 205 

Arg Ala Tyr Asn lie Pro Met Asp Tyr Pro Thr Ala Leu Leu He Met 
210 215 220 

Trp Asn Phe Gly Val Val Gly Met Met Ser He His Trp Gin Gly Pro 
225 230 235 240 

Leu Arg Leu Gin Gin Gly Tyr Leu He Phe Val Ala Ala Leu Met Ala 
245 250 255 

Leu Val Phe He Lys Tyr Leu Pro Glu Trp Thr Ala Trp Ala Val Leu 
260 265 270 

Ala Ala He Ser He Trp Asp Leu He Ala Val Leu Ser Pro Arg Gly 
275 2B0 285 

Pro Leu Arg He Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Gin He 
290 295 300 

Phe Pro Ala Leu He Tyr Ser Ser Thr Val Val Tyr Ala Leu Val Asn 
305 310 315 320 

Thr Val Thr Pro Gin Gin Ser Gin Ala Thr Ala Ser Ser Ser Pro Ser 
325 330 335 

Ser Ser Asn Ser Thr Thr Thr Thr Arg Ala Thr Gin Asn Ser Leu Ala 
340 345 350 

Ser Pro Glu Ala Ala Ala Ala Ser Gly Gin Arg Thr Gly Asn Ser His 
355 360 365 

Pro Arg Gin Asn Gin Arg Asp Asp Gly Ser Val Leu Ala Thr Glu Gly 
370 375 380 

Met Pro Leu Val Thr Phe Lys Ser Asn Leu Arg Gly Asn Ala Glu Ala 
385 390 395 400 

Ala Gly Phe Thr Gin Glu Trp Ser Ala Asn Leu Ser Glu Arg Val Ala 
405 410 415 

Arg Arg Gin He Glu Val Gin Ser Thr Gin Ser Gly Asn Ala Gin Arg 
420 425 430 

Ser Asn Glu Tyr Arg Thr Val Thr Ala Pro Asp Gin Asn His Pro Asp 
435 440 445 

Gly Gin Glu Glu Arg Gly He Lys Leu Gly Leu Gly Asp Phe He Phe 
450 455 460 

Tyr Ser Val Leu Val Gly Lys Ala Ser Ser Tyr Gly Asp Trp Thr Thr 

470 475 480 

Thr He Ala Cys Phe Val Ala He Leu He Gly Leu Cys Leu Thr Leu 
485 490 495 

Leu Leu Leu Ala He Trp Arg Lys Ala Leu Pro Ala Leu Pro He Ser 
500 505 510 

He Thr Phe Gly Leu He Phe Cys Phe Ala Thr Ser Ala Val Val Lys 
515 520 525 

Pro Phe Met Glu Asp Leu Ser Ala Lys Gin Val Phe He 
530 535 540 

(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTNCCNGART GGACNGYCTG G 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
RCANGCDATN GTNGTRTTCC A 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TTTTTTCTCG AGACNGCNCA RGARAGAAAY GA 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TTTTTTGGAT CCTARAADAT RAARTCNCC 
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What is claimed is: 

1. An isolated nucleic acid comprising a nucleotide sequence 
encoding a protein selected from the group consisting of a normal 
presenilin-1 protein, a mutant presenilin-l protein, a normal 
presenilin-2 protein, euid a mutant presenilin-2 protein. 

2. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a normal presenilin-i protein and wherein said 
nucleotide sequence is selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 2; 

(2) a sequence encoding a protein comprising the human 
presenilin-l amino acid sequence of SEQ ID NO: 4; 

(3) a sequence encoding a protein comprising the murine 
presenilin-l amino acid 8ec[uence of SEQ ID NO: 17 

(4) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 2 wherein residue 257 is replaced by 
alemine and residues 258-290 are omitted; 

(5) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 4 wherein residue 253 is replaced by 
alanine and residues 254-286 are omitted; cuid 

(6) a sequence encoding a normal presenilin-l protein and 
capable of hybridizing to a sequence complementary to any 
sequence of (1) - (5) under stringent hybridization conditions, 
3- An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutcuit presenilin-l protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 2 selected 
from the group consisting of A79?, V82L,V96F, Y115H, M139T, 
M139V, I143T, M146L, M146V, H163R, H163Y, L171P, G209V, I211T, 
A231T, A246E, A260V, C263R, P264L, P267S, E280A, E280G, A285V, 
L286V, A291-319, G384A, L392V and C410y; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-l amino acid sequence of SEQ ID NO: 2; 

(2) a sequence encoding a protein comprising the human 
presenilin-l amino acid sequence of SEQ ID NO: 4; 
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(3) a sequence encoding a protein comprising the murine 
presenilin-1 amino acid sequence of SEQ ID NO: 17; 

(4) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 2 wherein residue 257 is replaced by 
alanine and residues 258-290 are omitted; 

(5) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 4 wherein residue 253 is replaced by 
alanine and residues 254-286 are omitted; and 

(6) a sequence encoding a normal presenilin-1 protein emd 
capable of hybridizing to a sequence complementary to any 
sequence of (1) - (5) under stringent hybridization conditions. 

4. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutant presenilin-1 protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 19 
selected from the group consisting of M239V, N141I and I420T; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 2; 

(2) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 4; 

(3) a sequence encoding a protein comprising the murine 
presenilin-1 amino acid sequence of SEQ ID NO: 17; 

(4) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 2 wherein residue 257 is replaced by 
alanine and residues 258-290 are omitted; 

(5) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 4 wherein residue 253 is replaced by 
alanine and residues 254-286 are omitted; and 

(6) a sequence encoding a normal presenilin-1 protein and 
capable of hybridizing to a sequence complementary to any 
sequence of (1) - (5) under stringent hybridization conditions. 

5. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a normal presenilin-2 protein and wherein said 
nucleotide sequence is selected from the group consisting of 

(Da sequence encoding a protisin comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19; 
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(2) a sec[uence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted; and 

(3) a sequence encoding a normal presenilin-2 protein and 
capable of hybridizing to a sequence complementary to any one of 
sequences (1) - (2) vinder stringent hybridization conditions. 

6, An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutant presenilin-2 protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 19 
selected from the group consisting of M239V, N141I and I420T; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19; 

(2) a sequence encoding a protein comprising the human 
presenilin-2 amino acid of sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted; and 

(3) a sequence encoding a normal presenilin-2 protein and 
capable of hybridizing to a sequence complementary to any 
sequence of (1) - (2) under stringent hybridization conditions. 

7. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutamt presenilin-2 protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 2 selected 
from the group consisting of A79?, V82L,V96F, Y115H, M139T, 
M139V, I143T, M146L, M146V, H163R, H163Y, L171P, G209V, I211T, 
A231T, A246E, A260V, C263R, P264L, P267S, E280A, E280G, A285V, 
L286V, A291-319, G384A, L392V and C410Y; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19; 

(2) a sequence encoding a protein comprising the human 
presenilin-2 amino acid of sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted; and 

(3) a sequence encoding a normal presenilin-2 protein Sand 
capable of hybridizing to a sequence complementary to any 
sequence of (1) - (2) under stringent hybridization conditions. 
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8 . An isolated nucleic acid comprising a nucleotide secjuence of 
at least 10 consecutive nucleotides selected from the group 
consisting SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 
13, SEQ ID NO: 14, SEQ ID NO: 15, and a sequence complementary to 
any of these sequences, 

9. An isolated nucleic acid comprising a nucleotide secjuence of 
at least 15 consecutive nucleotides selected from the group 
consisting SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 
13, SEQ ID NO: 14, SEQ ID NO: 15, and a sequence complementary to 
any of these sequences. 

10. An isolated nucleic acid comprising a nucleotide sequence of 
at least 20 consecutive nucleotides selected from the group 
consisting SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 
13, SEQ ID NO: 14, SEQ ID NO: 15, and a sequence complementary to 
any of these sequences. 

11. An isolated nucleic acid comprising a nucleotide sequence 
comprising at least 10 consecutive nucleotides from a presenilin 
insert in a plasmid selected from the group consisting of ATCC 
Accession # 97214, ATCC Accession # 97508, ATCC Accession # 97124 
and ATCC Accession # 97428. 

12 . An isolated nucleic acid comprising a nucleotide sequence 
encoding at least one ftinctional domain of a presenilin protein 
selected from the group consisting of a normal presenilin-1 
protein, a mutant presenilin-1 protein, a normal presenilin-2 
protein, and a mutant pre8enilin*2 protein. 

13 . An isolated nucleic acid as in claim 12 wherein said 
f\inctional domain is a presenilin-1 functional domain 
corresponding to a domain selected from the group consisting of a 
presenilin-1 N-terminai, TMl^ TMl-^2, TM2, tM2-#3, TM3, TM3-^, TM4, 
TM4-»5, TM5, TM5-*6, TM6, TM6-»7, TM7, and C- terminal domain. 

14. An isolated nucleic acid as in claim 12 wherein said 
fxinctional domain is a presenilin-2 functional domain 
corresponding to a domain selected from the group consisting of a 
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presenilin-2 N-terminal, TMl, TM1^2, TM2, TM2-»3, TM3, TM3^4, TM4, 
TM4-»5, TM5, TM5-»6 , TM6 , TM6-^7, TM7, and C- terminal domain. 

15. An isolated nucleic acid comprising a nucleotide sequence 
encoding an antigenic determinant of a presenilin protein 
selected from the group consisting of a normal presenilin- 1 
protein, a mutant presenilin- 1 protein, a normal presenilin-2 
protein, and a mutant presenilin-2 protein - 

16. An isolated nucleic acid as in claim 15, wherein said 
sequence encodes a presenilin-1 antigenic determinauit 
corresponding to a presenilin-1 antigenic determinant selected 
from the group consisting of amino acid residues 27-44, 28-61, 
46-48, 50-60, 65-71, 66-67, 107-111, 109-112, 120-121, 120-122, 
125-126, 155-160, 185-189, 214-223, 218-221, 220-230, 240-245, 
241-243, 267-269, 273-282, 300-370, 302-310, 311-325, 332-342, 
346-359, 372-382, 400-410 and 400-420 of SEQ ID NO: 2. 

17. An isolated nucleic acid as in claim 15, wherein said 
sequence encodes a presenilin-2 antigenic determinant 
corresponding to a presenilin-2 antigenic determinant selected 
from the group consisting of amino acid residues 25-45, 50-63, 
70-75, 114-120, 127-132, 162-167, 221-226, 282-290, 310-314, 321- 
338, 345-352, 380-390 and 430-435 of SEQ ID NO: 19. 

18. A method for identifying allelic variants or heterospecif ic 
homologues of a human presenilin gene comprising 

choosing a nucleic acid probe or primer capable of 
hybridizing to a human presenilin gene sequence under stringent 
hybridization conditions; 

mixing said probe or primer with a sample of nucleic acids 
which may contain a nucleic acid corresponding to said variant or 
homologue ; 

detecting hybridization of said probe or primer to said 
nucleic acid corresponding to said varieuit or homologue. 

19. A method as in claim 18 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
human genomic DNA, human mRNA, and human cDNA. 

20., A method as in claim 18 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
mammalian genomic DNA, mammalian mRNA, and mammal icux cDNA. 
21. A method as in claim 18 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
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invertebrate genomic DNA, invertebrate itiRNA, and invertebrate 
cDNA. 

22. A method as in claim 18 further comprising the step of 
isolating said nucleic acid corresponding to said varisuit or 
homologue • 

23. A method as in claim 18 wherein said nucleic acid is 
identified by hybridization. 

24. A method as in claim 18 wherein said nucleic acid is 
identified by PGR amplification. 

25. A method for identifying allelic variants or heterospecif ic 
homologues of a human presenilin gene comprising 

choosing an cuitibody capsUale of selectively binding to a 
human presenilin protein; 

mixing said antibody with a sample of proteins which may 
contain a protein corresponding to said variant or homologue; 

detecting binding of said antibody to said protein 
corresponding to said variant or homologue. 

26. A method as in claim 25 wherein said sample comprises a 
sample of proteins selected from the group consisting of human 
proteins, human fusion proteins, and proteolytic fragments 
thereof. . 

27. A method as in claim 25 wherein said sample con^rises a 
sample of nucleic acids selected from the group consisting of 
mammalian proteins, mammalian fusion proteins, and proteolytic 
fragments thereof.. 

28. A method as in claim 25 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
invertebrate proteins, invertebrate fusion proteins, and 
proteolytic fragments thereof.. 

29. A method as in claim 25 further comprising the step of 
substantially purifying said protein corresponding to variant or 
homologue . 

30. An isolated nucleic acid comprising an allelic variant or a 
heterospecif ic homologue of a human presenilin gene. 

31. An isolated nucleic acid encoding ah allelic varicuit or 
heterospecif ic homologue of a h\aman presenilin protein. 

32 . An isolated nucleic acid as in claim 31 wherein said nucleic 
acid encodes a Drosophila melanoaaster homologue of a human 
presenilin gene. 



C! ipcrm in: cuccr /di tt c -5c\ 



wo 96/34099 



9 PCT/CA96/00263 



- 163 - 

33. An isolated nucleic acid as in claim 32 wherein said nucleic 
acid comprises a nucleotide secpience selected from the group 
consisting of 

(1) a sequence encoding a protein comprising the DmPS amino 
acid sequence of SEQ ID NO: 21; 

(2) a sequence encoding a presenilin homologue protein and 
capable of hybridizing to a sequence complementary to the 
sequence of (1) under stringent hybridization conditions. 

34. An isolated nucleic acid comprising a nucleotide sequence of 
at least 10 consecutive nucleotides selected from the group 
consisting of SEQ ID NO: 21 and a sequence complementary to SEQ 
ID NO: 21. 

35. An isolated nucleic acid comprising a recombinant vector 
including a nucleotide sequence of any one of claims 1-34, 

36. An isolated nucleic acid as in claim 35 wherein said vector 
is an expression vector and said presenilin nucleotide secpience 
is operably joined to a regulatory region. 

37. An isolated nucleic acid as in claim 36 wherein said 
expression vector may express said presenilin sequence in 
mammalian cells. 

38. An isolated nucleic acid as in claim 37 wherein said cells 
are selected from the group consisting of fibroblast, liver, 
kidney, spleen, bone marrow and neurological cells. 

39. An isolated nucleic acid as in claim 37 wherein said vector 
is selected from the group consisting of vaccinia virus, 
adenovirus, retrovirus, neurotropic viruses and Herpes simplex. 

40. An isolated nucleic acid as in claim 36 wherein said 
expression vector encodes at least a functional domain of a 
presenilin protein selected from the group consisting of normal 
presenilin^l, mutant presenilin-1, normal presenilin-2, and 
mutant presenilin-2. 

41. An isolated nucleic acid as in claim 36 wherein said vector 
further comprises sequences encoding an exogenous protein 
operaibly joined to said presenilin sequence and whereby said 
vector encodes a presenilin fxosion protein. 

42. An isolated nucleic acid as in claim 41 wherein said 
exogenous protein is selected from the group consisting of lacZ, 
trpE, maltose -binding protein, poly-His tags or glutathione- S- 
transf erase. 
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43. An isolated nucleic acid comprising a recombinant expression 
vector including nucleotide sequences corresponding to an 
endogenous regulatory region of a presenilin gene. 

44. An isolated nucleic acid as in claim 43 wherein said 
endogenous regulatory region is operably joined to a marker gene. 

45. A host cell transformed with an expression vector of any one 
of claims 36-44, or a descendant thereof, 

46. A host cell as in claim 45 wherein said host cell is 
selected from the group consisting of bacterial cells cuid yeast 
cells . 

47. A host cell as in claim 45 wherein said host cell is 
selected from the group consisting of fetal cells, embryonic stem 
cells, zygotes, gametes, and germ line cells. 

48. A host cell as in claim 45 wherein said cell is selected 
from the group consisting of fibroblast, liver, kidney, spleen, 
bone marrow and neurological cells. 

49. A host cell as in claim 45 wherein said cell is an 
invertebrate cell. 

50. A non-human cuiimal model for Alzheimer's Disease, wherein a 
genome of said amimal, or an ancestor thereof, has been modified 
by at least one recombinant construct, and wherein said 
recombinant construct has introduced a modification selected from 
the group consisting of (1) insertion of nucleotide sequences 
encoding at least a functional domain of a heterospecif ic normal 
presenilin gene, (2) insertion of nucleotide sequences encoding 
at least a functional domain of a heterospecif ic mutant 
presenilin gene, (3) insertion of nucleotide sequences encoding 
at least a functional domain of a conspecific homologue of a 
heterospecif ic mutant presenilin gene, and (4) inactivation of an 
endogenous presenilin gene. 

51. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a normal human presenilin- l gene. 

52. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a mutant human presenilin- 1 gene. 

53. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a normal human presenilin- 2 gene. 
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54. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a fiinctional 
domain of a mutant human pre6enilin-2 gene. 

55. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a normal or mutant htunan presenilin protein. 

56. An animal as in claim 50 wherein said animal is selected 
from the group consisting of rats, mice, hamsters, guinea pigs, 
rabbits, dogs, cats, goats, sheep, pigs, and non-human primates. 

57. An animal as in claim 50 wherein said animal is an 
invertebrate . 

58. A method for producing at least a functional domain of a 
presenilin protein comprising culturing a host cell of any of 
claims 45-49 under suitable conditions to produce said presenilin 
by expressing said nucleic acid. 

59. A substantially pure preparation of a protein selected from 
the group consisting of a normal presenilin- 1 protein, a mutant 
presenilin-1 protein, a normal presenilin-2 protein, and a mutant 
presenilin- 2 protein. 

60. A substantially pure preparation as in claim 59 wherein said 
protein comprises a normal presenilin- 1 protein selected from the 
group consisting of 

(1) a protein comprising the amino acid sequence of SEQ ID 

NO: 2; 

(2) a protein comprising the amino acid sequence of SEQ ID 

NO: 4; 

(3) a protein comprising the amino acid sequence of SEQ ID 
NO: 17; 

(4) a protein comprising the amino acid of secjuence of SEQ 
ID NO: 2 wherein residue 257 is replaced by alcuiine and residues 
258-290 are omitted; and 

(5) a protein comprising the amino acid of sequence of SEQ 
ID NO: 4 wherein residue 253 is replaced by alanine and residues 
254-286 are omitted. 

61. A substantially pure preparation as in claim 59 wherein said 
protein comprises a mutant presenilin- 1 protein including at 
least one mutation which corresponds to a mutation of SEQ ID NO: 

2 selected from the group consisting of A79?, V82L,V96F, Y115H, 
M139T, M139V, I143T, M146L, M146V, H163R, H163Y, . L171P, G209V, 
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I211T, A231T, A246E, A260V, C263R, P264L, P267S, E280A, E2B0G, 
A285V, L286V, A291-319, G384A, L392V and CAIOY; and 

wherein said protein otherwise corresponds to an amino acid 
sequence selected from the group consisting of 

(1) an amino acid sequence of SEQ ID NO: 2; 

(2) an amino acid sequence of SEQ ID NO: 4; 

(3) an amino acid sequence of SEQ ID NO: 17; 

(4) an amino acid of sequence of SEQ ID NO: 2 wherein 
residue 257 is replaced by alanine and residues 258-290 are 
omitted; cuid 

(4) an amino acid of sequence of SEQ ID NO: 4 wherein 
residue 253 is replaced by alanine and residues 254-286 are 
omitted. 

62. A substantially pure preparation as in claim 59 wherein said 
protein comprises a normal presenilin-2 protein selected from the 
group consisting of 

(1) a protein comprising the amino acid sequence of SEQ ID 
NO: 19; and 

(2) a protein comprising the amino acid of sequence of SEQ 
ID NO: 19 wherein residues 263-296 are omitted. 

63. A substantially pure preparation as in claim 59 wherein said 
protein comprises a mutant presenilin-2 protein including at 
least one mutation which corresponds to a mutation of SEQ ID NO: 
19 selected from the group consisting of M239V, NI41I and I420T; 
and 

wherein said protein otherwise corresponds to an amino acid 
sequence selected from the group consisting of 

(1) an amino acid sequence of SEQ ID NO: 19; and 

(2) an amino acid of sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted. 

64. A substantially pure preparation of a polypeptide comprising 
an amino acid sequence of at least 5 consecutive amino acid 
residues selected from the group consisting SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 17, SEQ ID NO: 19, and SEQ ID NO: 21. 

65. A substantially pure preparation of a polypeptide comprising 
an amino acid sequence of at least 10 consecutive amino acid 
residues selected from the group consisting SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 17, SEQ ID NO: 19, and SEQ ID NO: 21. 

66. A subst£uitially pure preparation of a polypeptide comprising 
an amino acid sequence of at least 15 consecutive amino acid 
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residues selected from the group consisting SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 17, SEQ ID NO: 19, and SEQ ID NO: 21. 

67. A substantially pure preparation of a polypeptide comprising 
at least one fvinctional domain of a presenilin protein selected 
from the group consisting of a normal presenilin- 1 protein, a 
mutant presenilin-1 protein, a normal presenilin-2 protein, and a 
mutemt presenilin-2 protein. 

68. A stibstcuitially pure preparation as in claim 67 wherein said 
functional domain is a presenilin- 1 functional domain 
corresponding to a domain selected from the group consisting of a 
presenilin- 1 N- terminal, TMl, TMl-^2, TM2, TM2-»3, TM3, TM3-^, TM4, 
TM4-»5, TM5, TM5-»6, TM6, TM6-^7, TM7 , and C- terminal domain. 

69. A substantially pxire preparation as in claim 67 wherein said 
functional domain is a presenilin-2 ftinctional domain 
corresponding to a domain selected from the group consisting of a 
presenilin-2 N- terminal, TMl, TMl-*2, TM2, TM2-»3, TM3 , TM3-^, TM4, 
TM4-*5, TM5, TM5->6, TM6 , TM6-*7, TM7, auid C- terminal domain. 

70. A sxibstantially pure preparation of a polypeptide comprising 
an antigenic determinant of a presenilin protein selected from 
the group consisting of a normal presenilin-1 protein, a mutant 
presenilin-l protein, a normal presenilin-2 protein, and a mutant 
presenilin-2 protein. 

71. A substantially pure preparation as in claim 70, wherein 
said polypeptide comprises a presenilin-1 suitigenic determinant 
corresponding to a presenilin-l antigenic determinemt selected 
from the group of nucleotide consisting of amino acid residues 
27-44, 28-61, 46-48, 50-60, 65-71, 66-67, 107-111, 109-112, 120- 
121, 120-122, 125-126, 155-160, 185-189, 214-223, 218-221, 220- 
230, 240-245, 241-243, 267-269, 273-282, 300-370, 302-310, 311- 
325, 332-342, 346-359, 372-382, 400-410 and 400-420 of SEQ ID NO: 
2. 

72. A substcuitially pure preparation as in claim 70, wherein 
said polypeptide comprises a presenilin-l antigenic determinemt 
corresponding to a presenilin-1 antigenic determinant selected 
from the group of nucleotide consisting of amino acid residues 
25-45, 50-63, 70-75, 114-120, 127-132, 162-167, 221-226, 282-290, 
3lb-314, 321-338, 345-352, 380-390 and 430-435 of SEQ ID NO: 19. 

73 . A method of producing antibodies which selectively bind to a 
presenilin comprising the steps of 
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administering an immunogenically effective amoxint of a 
presenilin immunogen to cm animal; 

allowing said animal to produce antibodies to said 
immunogen ; and 

obtaining said antibodies from said animal or from a cell 
culture derived therefrom. 

74 . A sxibstantially pure preparation of an antibody which 
selectively binds to an antigenic determinant of a presenilin 
protein selected from the group consisting of a normal 
presenilin- 1, a mutant presenilin- 1, a normal presenilin-2, and a 
mutant presenilin-2. 

75. A substantially pure preparation of an antibody as in claim 
74 wherein said antibody selectively binds to an antigenic 
determinant of a mutant presenilin- 1 cuid fails to bind to a 
normal presenilin- 1 protein. 

76. A substantially pure preparation of an antibody as in claim 
74 wherein said antibody selectively binds to an antigenic 
determinant of a mutant presenilin-2 and fails to bind to a 
normal presenilin-2 protein. 

77. A cell line producing an antibody of any one of claims 74- 
76. 

78. A method for identifying compoxinds which can modulate the 
expression of a presenilin gene comprising 

contacting a cell with a test candidate wherein said cell 
includes a regulatory region of a presenilin gene operad^ly joined 
to a coding region; and 

detecting a change in expression of said coding region. 

79. A method as in claim 78 wherein said change comprises a 
change in a level of an mRNA transcript encoded by said coding 
region. 

80. A method as in claim 78 wherein said change comprises a 
change in a level of a protein encoded by said coding region. 

81. A method as in claim 78 wherein said change is a result of 
eui activity of a protein encoded by said coding region. 

82. A method as in claim 7a wherein said. coding region encodes a 
marker protein selected from the group consisting of 0- 
galactosidase, alkaline phosphatase, green fluorescent protein, 
suid lucif erase. 

83. A method for identifying compounds which can selectively 
bind to a presenilin protein comprising the steps of 
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providing a preparation including at least one presenilin 
component ; 

contacting said preparation with a sample including at least 
one candidate compound; and 

detecting binding of said presenilin component to said 
candidate compound. 

84. The method in 83 wherein said binding to said presenilin 
component is detected by an assay selected from the group 
consisting of: affinity chromatography, co-immunoprecipitation, a 
Biomolecular Interaction Assay, and a yeast two-hybrid system. 

85. A method of identifying compounds which can modulate 
activity of a presenilin comprising the steps of 

providing a cell expressing a normal or mutant presenilin 

gene ; 

contacting said cell with at least one ccindidate compound; 

and 

detecting a change in a marker of said activity. 

86. A method as in claim 85 wherein measurement of said marker 
indicates a difference between cells bearing an expressed mutant 
presenilin gene cmd otherwise identical cells free of an 
expressed mutant presenilin gene. 

87. A method as in claim 85 wherein said change comprises a 
cheuige in a non-specific marker of cell physiology selected from 
the group consisting of pH, intracellular calcium, cyclic AMP 
levels, GTP/GDP ratios, phosphatidylinositol activity, and 
protein phosphorylation. 

88. A method as in claim 85 wherein said change comprises a 
change in expression of said presenilin. 

89. A method as in claim 85 wherein said change comprises a 
change in intracellular concentration or flux of an ion selected 
from the group consisting of Ca^*, Na* and K*. 

90. A method as in claim 85 wherein said change comprises a 
change in occurrence or rate of apoptosis or cell death. 

91. A method as in claim 85 wherein said change comprises a 
change in production of AjS peptides. 

92. A method as in claim 85 wherein said change comprises a 
change in phosphorylation of at least one microtubule associated 
protein. 

93. A method as in claim 85 wherein said cell is a cell cultured 
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94. A method as in claim 93 wherein said cell is a transformed 
host cell of any one of claims 45-49. 

95. A method as in claim 93 wherein said cell is explanted from 
a host bearing at least one mutant presenilin gene. 

96. A method as in claim 93 wherein said cell is explanted from 
a transgenic cuiimal of any one of claims 50-57. 

97. A method as in claim 85 wherein said cell is a cell in a 
live animal. 

98. A method as in claim 97 wherein said cell is a cell of a 
transgenic animal of any one of claims 50-57. 

99. A method as in claim 85 wherein said cell is in a human 
subject in a clinical trial. 

100. A diagnostic method for determining if a subject bears a 
mutant presenilin gene comprising the steps of 

providing a biological sample of said subject; 
detecting in said sample a mutant presenilin nucleic acid, a 
mutant presenilin protein, or a mutant presenilin activity. 

101. A method as in claim 100, wherein a mutant presenilin 
nucleic acid is detected by an assay selected from the group 
consisting of direct nucleotide sequencing, probe specific 
hybridization, restriction enzyme digest cuid mapping, PGR 
mapping, ligase -mediated PGR detection, RNase protection, 
electrophoretic mobility shift detection, and chemical mismatch 
cleavage . 

102. A method as in claim 100, wherein a mutant presenilin 
protein is detected by an assay selected from the group 
consisting of an immunoassay, a protease assay, and an 
electrophoretic mobility assay. 

103 . A pharmaceutical preparation comprising a substantially pure 
presenilin protein and a pharmaceutical ly acceptable carrier. 

104. A pharmaceutical preparation comprising an expression vector 
operably encoding a presenilin protein, wherein said expression 
vector may express said presenilin protein in a human subject > 
and a pharmaceutically acceptable carrier. 

105. A pharmaceutical preparation comprising an expression vector 
operably encoding a presenilin ant i sense sequence, wherein said 
expression vector may express said presenilin antisense sequence 
in a human subject, amd a pharmaceutically acceptable carrier. 
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106. A pharmaceutical preparation comprising a substantially pure 
antibody, wherein said antibody selectively binds to a mutant 
presenilin protein, and a pharmaceutically acceptable carrier. 

107. A pharmaceutical preparation as in claim 106 wherein said 
preparation is essentially free of an antibody which selectively 
binds a normal presenilin protein. 

108. A pharmaceutical preparation comprising a substantially pure 
preparation of an antigenic determinant of a mutant presenilin 
protein. 

109. A pharmaceutical preparation as in claim 108 wherein said 
preparation is essentially free of an antigenic determinant of a 
normal presenilin protein. 

110. A method of treatment for a patient bearing a mutant 
presenilin gene comprising the step of administering to said 
patient a therapeutically effective amount of the pharmaceutical 
preparation of any one of claims 103-109. 

111. A method as in claim 110, wherein said pharmaceutical 
preparation is targeted to a cell type is selected from the group 
consisting of heart, brain, lung, liver, skeletal muscle, kidney, 
pancreas and neurological cells. 
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